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TOXTN GEN ? ™M XENORHABDUS NEMATQPHIfrUS . 

Technical Field 

The present invention concerns the identification 
and isolation of a new class of protein toxins specific 
against insects which are produced by bacteria from the 
species Xenorhabdus nematophilus and possibly by the 
species X.beddingii. In addition, the present invention 
relates to the insertion of this class of toxin into 
recombinant viruses, bacteria, protozoa, fungi, and 
transgenic plants in order to broaden the use of these 
toxins for control of a large range of insect pests and 
plant parasitic nematodes. 
Background 

Insect pathogenic nematodes of the family 

15 Steinernematidae are known to be symbiotically associated 
with bacteria of the genus Xenorhabdus. It has been 
observed that these bacteria have the ability to kill a 
wide range of different insects without the aid of their 
nematode partners. 

20 The present inventors have identified a new class of 

toxins. A DMA fragment encoding one of these toxins has 
been isolated from Xenorhabdus nematophilus stain A24 and 
characterised by sequencing. As will be recogised by 
persons skilled in the art, DNA fragments encoding members 

25 of this new class of toxins may be usefully introduced 
into viral agents, including entomopox and nuclear 
polyhedrosis viruses; bacteria (including Gracilicutes, 
Firmicutes, Tenericutes and Mendosicutes) -, fungi; 
protozoa; and plants. 

30 Summary of the Present Invention 

in a first aspect, the present invention consists in 
a polynucleotide molecule comprising a nucleotide sequence 
which encodes an insecticidal toxin and which is at least 
70% homologous to the nucleotide sequence shown in Table 1 
35 from residue 83 to 919, or a fragment thereof which 
fragment encodes an insecticidal polypeptide. 
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In a preferred embodiment of the present invention 
the nucleotide sequence is at least 90% to the sequence 
shown in Table 1 from residue 83 to 919. 

Preferably, the nucleotide sequence which encodes an 
5 insecticidal toxin from Xenorhabdus and more preferably, 
the nucleotide sequence substantially corresponds to the 
sequence shown in Table 1 from residue 83 to 919. 

In a second aspect the present invention provides in 
an insecticidal toxin which includes an amino acid 

10 sequence which is at least 70% homologous to residues 1 to 
278 shown in Table 2 or a functional fragment thereof. 

In a preferred embodiment of the present invention 
the insecticidal toxin includes an amino acid sequence 
which is at least 90% homologous to residues 1 to 278 

15 shown in Table 1 or a functional fragment thereof. 

In a further preferred embodiment the insecticidal 
toxin includes an amino acid sequence substantially 
corresponding to residues 1 to 278 in Table 1 or a 
functional fragment thereof. 

20 In a third aspect the present invention provides in 

a recombinant organism, the organism being characterised 
in that it is transformed with the polynucleotide molecule 
of the first aspect of the present invention. 

The organisms which may be usefully transformed with 

25 the polynucleotide molecule of the first aspect of the 
present invention include viral agents such as entomopox 
and nuclear polyhedrosis viruses; bacteria, such as 
Gracillcutes , Firmicutes, Tenericutes and Nendoslcutes; 
fungi; protozoa; and plants. 

30 The term "substantially corresponds" as used herein 

in relation to the nucleotide sequence is intended to 
encompass minor variations in the nucleotide sequence 
which due to degeneracy do not result in a change in the 
encoded protein. Further this term is intended to 

35 encompass other minor variations in the sequence which may 
be required to enhance expression in a particular system 
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but in which the variations do not result in a decrease in 
biological activity of the encoded protein. 

The term "substantially corresponding" is used 
herein in relation to the amino acid sequence is intended 
5 to encompass minor variations in the amino acid sequence 
which do not result in a decrease in biological activity 
of the insecticidal toxin. These variations may include 
conservative amino acid substitutions. The substitutions 
envisaged are:- 

10 G, A, V, I, L, M; D, E; N, Q; S f T; K, R, H; F, 

Y f W, H; and P, fla-alkal amino acids. 

As used herein the term "functional fragments" is 
intended to encompass fragments of the insecticidal toxin 
which retain insecticidal activity. 

15 In a fourth aspect, the present invention provides a 

method for controlling the proliferation of insects, 
comprising applying to an infested area a recombinant 
organism according to the third aspect optionally in 
admixture with an acceptable agricultural carrier. 

20 Isolation and Characterisation of a Toxin from 

Xenorhabdus nematophilus A24 
Generation of a Cosmid Library 

Genomic DNA from Xenorhabdus nematophilus A24, 
isolated using the method of Marmur (1961) was 

25 partially digested using the restriction enzyme Sau 3A, to 
generate fragments of DNA that were in the size range of 
30 to 50 kilobasepairs (kb), and dephosphorylated using 
the enzyme calf alkaline phosphatase. The cosmid 
"Supercos" (Stratagene) was prepared to receive foreign 

30 insert DNA into its Bam HI cloning site according to the 
manufacturer's instructions. The digested DNA from 
X . nematophilus A24 was added to the cosmid DNA in a ratio 
of 3:1 and ligated together using the enzyme T4 DNA 
ligase. The ligated material was subsequently packaged 

35 into ^-bacteriophage using the Gigapack II XL Packaging 
Extract (Stratagene) as per the manufacturer's 
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instructions. The packaged DNA was subsequently 
transfected into the Escherichia coli strain NM554 (F-, 
recA . araD139,A (ara, leu) 7696, Alac Y74, galU -, galK-, 
hsr , hsm +. strA . mcrA f-] , mcr Bf-"| . Bacteria were plated 
5 out onto Luria Bertani (LB) agar plates containing 150 \ig 
ml* 1 ampicillin to select for those bacteria containing 
recombinant Supercos plasmids. 
Screening for Toxin Producing Clones 

Individual clones were grown overnight at 28°C in LB 

10 containing 150 jig ml* 1 ampicillin. Cultures were treated 
for 15 minutes with 2mg ml" 1 lysozyme in order to release 
any proteins produced by the recombinant DNA into the 
medium. Five |il aliquots of this solution were then 
injected directly into the haemocoel of three Gallerla 

15 mellonella fourth instar larvae. Appropriate controls 
containing lysozyme and non-recombinant E.coll NM554 
cultures were also injected to confirm the absence of any 
toxicity to these larvae. Two clones were found to have 
strong insecticidal activity. Injected larvae were found 

20 to be very sluggish after 30 hours, with all larvae dead 
within three days. 

Characterisation of Toxin Producing Clones 

The recombinant Supercos DNA from these clones was 
isolated using an alkaline lysis procedure (Maniatis et 

25 al,, 1982). Isolated DNA was digested with varying 

restriction enzymes and analysed using TAE agarose gel 
electrophoresis (Maniatis et al, 1982). It was found that 
both clones were identical and contained a 34.6 kb DNA 
insert from X* nematophilus A24. One of these clones 

30 cosl49 was chosen for further study. 

A 7.4kb Bam HI fragment from cos 14 9 was cloned into 
the plasmid vector pGEM7Z(f)+ (Promega) which was 
transformed into the E.coll strain DH5a (F-, 3>80dlac ZA 
M15, recAl, endA l, fl2xA96 r thi -1, lisdR17[rK-f sup 

35 E44. relA l, deoR, A[lacZYA-argF] U169) using 

electroporation at 25|iF, 200£2 and 2.5kV in a 0.2cm 
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cuvette in a Bio-Rad Gene Pulser. This -clone (N8pGEM) was 
found to continue to be toxic against G.mellonella larvae. 

Plasmid DNA from N8pGEM was isolated and digested 
with the restriction enzymes Cla l and Sph I . This resulted 
5 in the linearization of this plasmid containing one end 
(3') which was resistant to digestion by the enzyme 
Exonuclease III and the other end (5') which could be 
digested at a constant rate of 450 bases per minute at 
37°C by this enzyme using the Erase-a-Base kit from 

10 Promega. Using this enzyme aliquots containing decreasing 
size plasmids were obtained which were recircularised 
using the enzyme T4 DNA ligase. Recircularised plasmids 
were reintroduced into the bacterium E.coli strain DH5a 
using electroporation (see above). Varying size clones 

15 were selected and used for injecting G.mellonella larvae. 
The smallest clone which continued to be insect icidal was 
found to contain 1.5kb of X . nematophilus A24 DNA and was 
. designated tox 1. 

Plasmid DNA from tox 1 was isolated and digested 

20 with the restriction enzymes Sac I and Hind lll, 

respectively to again create linear molecules with one end 
resistant and the other sensitive to digestion with 
Exonuclease III. Deletion mutants were isolated and 
tested against G.mellonella larvae. A clone which now 

25 only contained 1.2kb of X. nematophilus A24 DNcc was 

isolated and was toxic against our test insect. This 
clone was designated toxb4 . 

The recombinant plasmids from toxb4 and three 
further (non-toxic) deletion clones, toxb5 f toxb6 and 

30 toxb7, were isolated and used for obtaining the sequence 
of both strands of the toxin gene. Sequencing was 
performed using the Applied Biosystems, Incorporated Model 
3?0 automated sequencer . Sequencing templates were 
prepared using double stranded DNA templates and the 21M13 

35 and SP6 primer sites located on the pGEM7Z(f )+ plasmid and 
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using the Taq dye primer cycle sequencing protocol 
(Applied Biosystems, Incorporated). 

The toxin gene was found to consist of an 834 
basepair open reading frame (Table 1) which translates 
5 into a 278 amino acid protein (Table 2). The start of the 
toxin gene sequence was preceded by appropriate DNA 
promoters necessary for transcription of the gene into a 
mRNA molecule prior to its synthesis into a peptide. 
These consist of a Shine-Dalgarno poly-purine sequence and 
10 -10 and -35 RNA polymerase recognition sequences (Table 

1). 

The DNA sequence and the derived amino acid 
sequences were analysed by sequence data bank analyses to 
determine if any other related sequences have previously 
15 been identified. The results indicated that no other 

sequence exists in the GenBank and EMBL data banks which 

has any similarity to this gene and its product. 

Cloning of Xenorhabdus Toxin into a High-Expression Vector 

Using the determined DNA sequence, 20-mer DNA 
20 primers were designed to cover the 5' and 3' region of the 
toxin gene and thus allow PCR amplification of the toxin 
and subsequent insertion into an expression vector. These 
primers included linker regions containing appropriate 
restriction enzyme sites ( Cla l and Nde l for the 5' primer 
25 and Bam HI for the 3' primer). 

5' primer CCATCGATCATATGGTTATTAAACC 

3' primer CGGGATCCTTATCTCTAAGGTTTTT 

Utilising a standard PCR protocol (Innis, M. A. , 
Gelford, D.H., Sminsky, J.J. and White, T.J.: (1990). PCR 
30 Protocols : A Guide to Methods and Applications. Academic 
Press, San Diego. 482pp) the toxin was amplified out of 
the genome of X . nematophilus A24 and restriction digested 
with Cla I and Bam HI. The digested fragment was 
subsequently ligated into pGEM-7Zf (+) and then subcloned 
35 from this vector into the high expression vector 

pT7T2b( derived from pETll [Novagen] and carrying the T7 
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promoter upstream from the start of the toxin insert; 
constructed by Dr. Karl Gordon, CSIRO/ Division of 
Entomology) using the restriction enzyme sites Nde I and 
Bam HI. The recombinant plasmid was transformed into the 
5 E. coli strain BL21(DE3) fF- omp T r B -m B which carries in 
its chromosome the T7 RNA polymerase gene under lac UV5 
control). Induction of the toxin may be achieved by the 
addition of 0.4mM IPTG at mid-exponential phase of the 
culture and continuing the incubation for an extra 4 
10 hours . 

In vitro expression of the 1.2 Kb insert fragment 
from toxb4 was achieved with the E.coli S30 Extract 
Procaryotic Translation System for linear DNA. Only a 
30kDa peptide was produced indicating that the 1.2 Kb 

15 fragment encodes one peptide only - the insect toxin. 
Southern Blot Hybridization of a Range of 
Xenorhabdus spp. and Photohabdus Luminescens Strains with 
the X. nematophilus A24 Toxin Gene 

DNA isolated from a range of Xenorhabdus species and 

20 Photohabdus (bacteria symbiotically associated with 

nematodes from the family Heterohabditidae) controls was 
digested to completion with the restriction enzyme Eco RV 
and run out on a 0.8% TAE agarose gel and the DNA 
fragments blotted and fixed onto a Hybond-N+ membrane 

25 (Amersham) as per the manufacturer's instructions. 

The toxin gene was radiolabelled with 32p using nick 
translation (Maniatis et al. f 1982) and probed against the 
blot containing the DNA of a range of Xenorhabdus and 
Photohabdus strains (Maniatis et al., 1982). Under 

30 moderate stringency wash conditions at 65°C(0.1% SDS, 1% 
SSPE, Maniatis et al, 1982) the toxin only hybridised to 
X. nematophilus and X. beddingii stra ps. However, the 
toxin gene did not show any homology to the DNA from 
strains of X. bovienli, X. poinaril, some unclassified 

35 Xenorhabdus spp. and Photohabdus luminescens. This result 
suggests that this toxin type is confined to strains from 
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the species X. nematophllus and X. beddlngll. As X. 
beddingli has insecticidal activity and shows homology to 
the toxin gene it is most probable that these sequences 
are part of related/similar yet slightly different toxins. 
5 A high stringency wash at 65°C(0.1% SDS, 0.1% SSPE; 

Maniatis et al. 1982) of the blot removed the message from 
the X.beddingii strain, but not from the X . nematophllus 
strains . 

Characteristics of the Toxic Protein Product 
10 The toxin is inactivated by heating to 65°C for 15 

minutes, yet stable at 45°C. Sodium dodecyl sulphate at a 
concentration of 0.1% does not inactivate this toxin 
thereby indicating extreme stability and thereby a protein 
which will fold into its appropriate form under a wide 
15 range of different conditions (which includes most cell 
types ) . 

This new class of toxin may be purified by one or 
more methods of protein purification well known in the 
art. Insecticidal fragments may be generated from the 

20 purified toxin using, for example, cleavage with trypsin 
or cyanogen bromide. 

As will be appreciated by those skilled in this 
field, the present invention provides a new class of 
toxins useful for genetically engineering a wide range of 

25 biological systems which will thus become more useful for 
control of insect pests detrimental to agricultural, 
aquatic and forest industries . 

It will be appreciated by persons skilled in the art 
that numerous variations and/or modifications may be made 

30 to the invention as shown in the specific embodiments 
without departing from the spirit or scope of the 
invention as broadly described. The present embodiments 
are, therefore, to be considered in all respects as 
illustrative and not restrictive. 
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TABLE 1 



1 
51 
101 
151 
201 
251 
301 
351 
401 
451 
501 
551 
601 
651 
701 
751 
801 
851 
901 
951 
1001 
1051 
1101 
1151 
1201 
1251 



AAGAAACCGT 
TTCAATAAAC 



ACAACTCCGA 
IGA3AAAGGT 
IACGTGTACT 
3GAATTCCAT 
AAGTGGTATT 
AGGATAACAC 
GCACTATTGG 
TTTTACCCAG 
GGATATTCCT 
GATAAATTAT 
GGCAAATCAA 
rACTTATCTT 
AATTTTTACA 
AAAAAAACTT 



AACAGCGGAA ATCAACGCTG CAATTTATAT 

S tart 

GCCA ACATAA TGGGAAAGT A CffiTGGTTAT 
-10 S-D^X_Z 



TAAACCCGTB 
TAACGCCTG? 



6T&AATACA AIT^toCCT GATGATAGAG 
GAATATCAAC CCGTTGAAAA GCAAATAGCG GGAGATAXAAJ 



AGAATTCAAG CAAACAAATG AAAGTCATAC 
ATCGAGCTAA GAAAGTAATA ATAGCATATG 
CATAATGTCT CTCAACTTCC AGAAGACTAT 
AGGTAGAATT TATCAAGTAT ACATGTCTAA 
GTGAGAATGG TGATCAAATT TCTAAAGATA 
AACGAACTGG AGTTTGAGGT CAACGTCTTA 
G AT CTT GA GA ATAAACTATT GGAAGATTTA 
TAGCACTATA TTTCTTTGCT TCACAAGAAC 
CAATCAAATG CAGCAAATTT TTTTAAAGTA 
ATCT GC TGTA ACATCACTGG GAAAAAGGAT 
ATGGTCTAGA AACTAAATCA TTAGAGAATT 
TCTAAACCTT TCTTTC G ACC ACCGCAGAAG 
CTACTTGGCC GGTCCAACAA AAGCGCCTAA 
CAGCAACAAC GTCTACAGCA GCTTCATCTA 



GCAGAACAGG 
ACGTCTTCTA 

T&.f7 TTTsefeA aaacciRa<3 a gataacccat ccagaaatac 



AGGATTGTA3 
CTTTAGCGGI 
TATAAAAATA 
TCTTTTATCT 
TGGCAAATGA 
AAAATACCTG 
TTCAGATGAA 
TTCCAATGGA 
ATTGATTTTT 
TTTTTCAAAA 
ATATTGAGAG 
TTACCTGATG 
ATTGCCAACA 
ATTGGAGAGT 



Stop 

ATGGATGATG CTGCAAAACG AAAATATAGT TCATTTATAA 
AAAGGGTAAT GATCCACGTG CAGCAGCAGC AAGTATTGGT 
GCAGTAACTT CGAAAAACTG CAAGGTAGAG ATTTAIAIAG 
AGCCAAGAAC ACAGGGTAAC ATTCT CCA TA AATAATACTG 
GGAGATCCAA AGTGTTGGAA CTCATTACCA AAAXATATAA 
AGTAGTGATA AGACGTAAGA TAAATATGGA AGGTTGTAAT 
TTCCTC A GAG GTGACCOCTC AG 



ATTTATGAAA 
AAGAGGTACA 
ACAAAAAGCG 
TATAAGACTA 
ACCAAATAAT 
CCTGATTTAT 
TCTATTGCAC 
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TABLE 2 



i (mvikpvttps viqltpddrv tpddkgeyqp vekqiagdii rvlefkqtne 

51 SHTGLYGIFY PAKK7IIAYA LAVSGIHNVS QLPEDYYKNK DNTGRIYQVY 
101 MSNLLSALLG ENGDQISKDM ANDFTQNELE FEVNVLKIPG IFLILRINYW 
KIYSDEDKLL ALYFFASQEL PMEANQQSNA ANFFKVTDFL LILSAVTSLG 
KRIFSKNFVN GICTKSLENY IERKKLSK FF FRPPQKLPDG RTGYIAGPTK 



151 
201 



251 Iapklpotsst K TTSEl&Sm wrvsix)k4r *PIQK^IYEN g*ccktki*f 



301 IYKRGTKG** STCSSSKYWY KKRQ*LRKTA R*RFI*YKTK PRTQGNILHK 
351 *Y*ENNGDFK CWNSLPKYIT *fiwirrki nmegcnsial PQR*PL 
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CLAIMS: 

1. A polynucleotide molecule comprising a nucleotide 
sequence which encodes an insecticidal toxin and which is 
at least 70% homologous to the nucleotide sequence shown 

5 in Table 1 from residue 83 to 919, or a fragment thereof 
which fragment encodes an insecticidal polypeptide. 

2. A polynucleotide molecule as claimed in claim 1 in 
which the nucleotide sequence is at least 90% homologous 
to the nucleotide sequence shown in Table 1 from residue 

10 83 to 919. 

3. A polynucleotide molecule comprising a nucleotide 
sequence substantially corresponding to the sequence shown 
in Table 1 from residue 83 to 919 or a fragment thereof, 
which fragment encodes an insecticidal polypeptide. 

15 4. A polynucleotide molecule according to claim 7, 
wherein the nucleotide sequence encodes an insecticidal 
toxin, or an insecticidal fragment thereof, from 
Xenorhabdus nematophilus 

5. A polynucleotide nucleotide molecule according to 
20 any one of claims 1 to 4 in which the molecule is a DNA 

molecule • 

6. A purified insecticidal toxin, or functional 
fragment thereof, from the bacterial genus Xenorhabdus. 

7. A purified insecticidal toxin, or functional 
25 fragment thereof, from Xenorhabdus nematophilus. 

8. An insecticidal toxin which includes an amino acid 
sequence which is at least 70% homologous to residues 1 to 
278 shown in Table 2 or a functional fragment thereof. 

9. An insecticidal toxin as claimed in claim 8 in which 
30 the toxin includes an amino acid sequence which is at 

least 90% homologous to residues 1 to 278 shown in Table 2 
or a functional fragment thereof. 

10. An insecticidal toxin, the toxin including an amino 
acid sequence substantially corresponding to residues 1 to 

35 278 shown in Table 1 or a functional fragment thereof. 
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11. A recombinant organism characterised in that it is 
transformed with the polynucleotide molecule according to 
any one of claims 1 to 5. 

12. A recombinant organism according to claim 10 

5 selected from the group consisting of entomopoxvirus , 

nuclear polyhedrosis virus, bacteria, fungi, protozoa and 
plants . 

13. A method for controlling the proliferation of 
insects, comprising applying to an infested area a 

10 recombinant organism according to claim 10 or 11 

optionally in admixture with an acceptable agricultural 
carrier . 
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AMENDED CLAIMS 

[received by the International Bureau bn 25 November 1994 (25.11.94); 
original claims' 6 and 7 amended; remaining claims unchanged (1 page)] 

1. A polynucleotide molecule comprising a nucleotide 
sequence which encodes an insecticidal toxin and which is 
at least 70% homologous to the nucleotide sequence shown 

5 in Table 1 from residue 83 to 919, or a fragment thereof 
which fragment encodes an insecticidal polypeptide. 

2. A polynucleotide molecule as claimed in claim 1 in 
which the nucleotide sequence is at least 90% homologous 
to the nucleotide sequence shown in Table 1 from residue 

10 83 to 919. 

3. A polynucleotide molecule comprising a nucleotide 
sequence substantially corresponding to the sequence shown 
in Table 1 from residue 83 to 919 or a fragment thereof, 
which fragment encodes an insecticidal polypeptide. 

15 4. A polynucleotide molecule according to claim 7, 
wherein the nucleotide sequence encodes an insecticidal 
toxin, or an insecticidal fragment thereof, from 
Xenorhabdus nematophilus 

5 . A polynucleotide nucleotide molecule according to 
20 any one of claims 1 to 4 in which the molecule is a DNA 

molecule . 

6. A purified insecticidal protein, or functional 
fragment thereof, from the bacterial genus Xenorhabdus. 

7. A purified insecticidal protein, or functional 
25 fragment thereof, from Xenorhabdus nematophilus. 

8. An insecticidal toxin which includes an amino acid 
sequence which is at least 70% homologous to residues 1 to 
278 shown in Table 2 or a functional fragment thereof. 

9. An insecticidal toxin as claimed in claim 8 in which 
30 the toxin includes an amino acid sequence which is at 

least 90% homologous to residues 1 to 278 shown in Table 2 
or a functional fragment thereof. 

10. An insecticidal toxin, the toxin including an amino 
acid sequence substantially corresponding to residues 1 to 

35 278 shown in Table 1 or a functional fragment thereof. 
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PESTICIDAL AGENTS 

The present invention relates to materials, agents and 
5 compositions having pesticidal activity which derive from 
bacteria, and more particularly from Xenorhabdus species. 
The invention further relates to organisms and methods 
employing such compounds and compositions. 

10 There is an ongoing requirement for materials, agents, 
compositions and organisms having pesticidal activity, 
for instance for use in crop protection or insect- 
mediated disease control. Novel materials are required 
to overcome the problem of resistence to existing 

15 pesticides. Ideally such materials are cheap to produce, 
stable, have a high toxicity (either when used alone or 
in combination) and are effective when taken orally by 
the pest target. Thus any invention which provided 
materials, agents, compositions or organisms in which any 

20 of these properties was enhanced would represent a step 
forward in the art. 

Xenorhabdus spp. in nature are frequently symbiotically 
associated with a nematode host, and it is known that 
25 this association may be used to control pest activity. 
For instance, it is known that certain Xenorhabdus spp. 
alone are capable of killing an insect host when injected 
into the host's hemocoel. 

30 In addition, one extracellular insecticidal toxin from 
Photorhabdus luminescens has been isolated (this species 
was recently removed from the genus Xenorhabdus , and is 
closely related to the species therein). This toxin* is 
not effective when ingested, but is highly toxic when 

35 injected into certain insect larvae (see Parasites and 
Pathogens of Insects Vol. 2, Eds. Beckage, N. E . et 
al., Academic Press 1993). 
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Also known are certain low-molecular weight heterocyclic 
compounds from P . luminescens and X.nematophilus which 
have antibiotic properties when applied intravenously or 
topically (see Rhodes, S.H. et al., PCT WO 84/01775)., 

Unfortunately none of these prior art materials have the 
ideal pesticide characteristics discussed above, and in 
particular, they do not have toxic activity when 
administered orally. 

The present invention provides pesticidal agents and 
compositions from Xenorhabdus species, organisms which 
produce such compounds and compositions, and methods 
which employ these agents, compositions and organisms, 
that alleviate some of the problems with the prior art. 

According to one aspect of the present invention there is 
disclosed a method of killing or controlling insect pests 
comprising administering cells from Xenorhabdus species 
20 or pesticidal materials derived or obtainable therefrom, 
orally to the pests. 

A PCT application of CSIRO published as WO 95/00647 
discloses an apparently toxic protein from Xenorhabdus 
25 nematophilus; however no details of the protein's 

toxicity are given, and certainly there is no disclosure 
of its use as an oral insecticide. 

Thus the invention provides an insecticidal composition 
30 adapted for oral administration to an insect, which 

composition comprises a pesticidal material obtainable 
from a Xenorhabdus species, or a pesticidal fragment 
thereof, or a pesticidal variant or derivative of either 
of these. 

35 

The composition may in fact comprise cells of Xenorhabdus 
or alternatively supernatant taken from cultures of cellB 
of Xenorhabdus species. However, the composition 
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preferably comprises toxins isolable from Xenorhabdus as 
illustrated hereinafter. Toxic activity has been 
associated with material encoded by the nucleotide 
sequence of Figure 2. Thus, the composition suitably 
5 comprises a pesticidal material which is encoded by all 
or part of the nucleotide sequence of Figure 2. 
Pesticidal fragments as well as variants or derivatives 
of such toxins may also be employed. 

10 The sequence of Figure 2 is of the order of 40kb in 

length. It is believed that this sequence may encode more 
than one protein, each of which may regulate or be 
insecticidal either alone or when presented together. It 
is a matter of routine to determine which parts are 

15 necessary or sufficient for insecticidal activity. 

As used herein the term ~ variant" refers to toxins which 
have modified amino acid sequence but which share similar 
activity. Certain amino acids may be replaced with 

20 different amino acids without altering the nature of the 
activity in a significant way. The replacement may be 
by way of ^conservative substitution • ' where an amino 
acid is replaced with an amino acid of broadly similar 
properties, or there may be some non-conservative 

25 substitutions. In general however, the variants will be 
at least 60% homologous to the native toxin, suitably at 
least 70% homologous and more preferably at least 90% 
homologous . 

30 The term ^ derivative ' 1 relates to toxins which have been 
modified for example by chemical or biological methods. 

These toxins are novel, and they and the nucleic acids 
which encode them form a further aspect of the invention. 

35 

A preferred Xenorhabdus species is the bacteria 
X-nematophilus. Particular strains of X.nematophilus 
which are useful in the context of the invention are 
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ATTC 19061 strain, available from the National Collection 
of Industrial and Marine Bacteria, Aberdeen, Scotland 
(NCIMB) . In addition, suitable strains include two novel 
strains of Xenorhabdus which were deposited at the NCIMB 
5 on 10 July 1997 and were designated with repository 
numbers NCIMB 40886 and NCIMB 40887, TheBe latter 
strains form a further aspect of the invention. 

All strains have common characteristics as set out in the 
10 following Table 1. 



Table 1 



Strains 



Characteristics ATCC 19061 NCIMB 40887 NCIMB 40B86 



Gram strain 
Shape/size 

Motile 

Bio luminescent 
Colour on NBTA* 
insecticidal on 
ingestion by 
insects 
Production of 
Antibiotics 
Resistant to 
ampicillin 

(50ng/ml) 
colony 
morphology/ 
colour 



Negative 
rods up to 
4^m long 
Yes 
No 

blue 

yes 

yes 

yes 

circular 

convex 

cream 



negative 
rods up to 
4jim long 
Yes 
No 

blue 

yes 

yes 

yes 

circular 

convex 

cream 



negative 
rods up to 

4fim long 

Yes 

No 

blue 

yes 

yes 

yes 

circular 

convex 

cream 



15 *NBTA (Oxoid nutrient agar containing 0.0025% 
bromothymol blue and 0.004% tetrazolium chloride) 

Preferably the pest target is an insect, and more 
preferably it is of the order Lepidoptera, particularly 
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Pieris brassicae, Pieris rapae, or Plutella xylostella or 
the order Diptera, particularly Culex qulnquefaciatus . 

In a preferred embodiment of the invention, cells from 
5 Xenorhabdus species or agents derived therefrom are used 
in conjunction with Bacillus thuringiensis as an oral 
pesticide. 

In further embodiments, rather than using Bacillus 
10 thuringiensis itself, pesticidal materials obtainable 
from B. thuringiensis (e.g. delta endotoxins or other 
isolates) are used in conjunction with Xenorhabdus 
species . 

15 The term 'obtainable from' is intended to embrace not 

only materials which have been isolated directly from the 
bacterium in question, but also those which have been 
subsequently cloned into and produced by other organisms. 

20 Thus the unexpected discovery that bacteria of the genus 
Xenorhabdus (and materials derived therefrom) have 
pesticidal activity when ingested, and that such bacteria 
and materials can be used advantageously in conjunction 
with B. thuringiensis (and toxins or materials derived 

25 therefrom), forms the basis of a further aspect of the 
present invention. The pesticidal activity of 
fl. thuringiensis isolates alone have been well documented. 
However, synergistic pesticidal activity between such 
isolates and bacteria of the Xenorhabdus species (or 

30 materials derived therefrom) has not previously been 
demonstrated. 

In still further embodiments of the invention, culture 
supernatant taken from cultures of Xenorhabdus species, 
35 particularly X. nematophilus , is used in place of cells 
from Xenorhabdus species in the methods above. 
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All of these methods can be employed, inter qliftr in peBt 
control. 

The invention also makes available pesticidal 
5 compositions comprising cells from Xenorhabdus species, 
preferably X.nematophilus , in combination with B. 
thurlngiensls. As with the methods above, a pesticidal 
toxin from B.thuringiensis (preferably a delta endotoxin) 
may be used as an alternative to B.thuringiensis in the 
10 compositions of the present invention 

Likewise, culture supernatant taken from cultures of 
Xenorhabdus species, preferably, X.nematophilus may be 
used in place of cells from Xenorhabdus species. 

15 

Such compositions can be employed, inter alia, for crop 
protection eg. by spraying crops, or for livestock 
protection. In addition, compositions of the invention 
may be used in vector control . 

20 

The invention further encompasses novel pesticidal agents 
which can be isolated from Xenorhabdus spp. Techniques 
for isolating such agents would be understood by the 
skilled person. 

25 

In particular, such techniques include the separation and 
identification of toxin proteins either at the protein 
level or at the DNA level. 

30 The applicants have cloned and partially sequenced a 

region of DNA from Xenorhabdus NCIMB 40887 which region 
codes for insecticidal activity and this is shown as 
Figure 2 ( SEQ ID NO. 1) hereinafter. Thus in a preferred 
embodiment the invention alBO provides a toxin which is 

35 encoded by DNA of SEQ ID No. 1 or a variant or fragment 
thereof . 
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The invention also provides a recombinant DNA which 
encodes such a toxin. The recombinant DNA of the 
invention may comprise the sequence of Figure 2 or a 
variant or fragment thereof. Other DNA sequences may 
5 encode similar proteins as a result of the degeneracy of 
the genetic code. All such sequences are encompassed by 
the invention. 

The sequence provided herein is sufficient to allow 
10 probes to be produced which can be used to identify and 
subsequently to extract DNA of toxin genes. This DNA may 
then be cloned into vectors and host cells as is 
understood in the art. 

15 DNA which comprises or hybridises with the sequence of 
Figure 2 under stringent conditions forms a further 
aspect of the invention. 

The expression ^hybridises with" means that the 
20 nucleotide sequence will anneal to all or part of the 
sequence of Figure 2 under stringent hybridisation 
conditions, for example those illustrated in - Molecular 
Cloning' 1 , A Laboratory Manual" by Sambrook, Fritsch and 
Maniatis, Cold Spring Habor Laboratory Press, Cold Spring 
25 Harbor, N.Y. 

The length of the sequence used in any particular 
analytical technique will depend upon the nature of the 
technique, the degree of complementarity of the sequence, 

30 the nature of the sequence and particularly the GC 
content of the probe or primer and the particular 
hybridisation conditions employed. Under high 
stringency, only sequences which are completely 
complementary will bind but under low stringency 

35 conditions, sequences which are 60% homologous to the 

target sequence, more suitably 80% homologous, will bind. 
Both high and low stringency conditions are encompassed 
by the term ^stringent conditions' 1 used herein. 



SUBSTITUTE SHEET (RULE 26) 



WO 98/08388 



PCT/GB97/02284 



8 

Suitable fragments of the DNA of Figure 2, i.e. those 
which encode pesticidal agents may be identified using 
standard techniques. For example, traneposon 

5 mutagenesis techniques may be used, for example as 

described by H.S. Siefert et al., Proc. Natl. Acad. Sci. 
USA, (1986) 83, 735-739. Vectors such as the cosmid 
cHRIMl, can be mutated using a variety of transposons and 
then screened for loss of insectidal activity. In this 

10 way regions of DNA encoding proteins responsible for 
toxic activity can be identified. 

For example, the mini-transposon mTn3(HIS3) can be 
introduced into a toxic Xenorhabdus clone such as cHRIMl, 

15 hereinafter referred to as * clone 1 1 , by electroporating 
cHRIMl DNA into E.coli RDP146 ( pLBlOl ) and mating this 
strain with E.coli RDP146 (pOX38 ) , followed by E . coli 
NS2114Sm. The final strain will contain cHRIMlDNA with a 
single insertion of the transposon mTn3(HIS3). These 

20 colonies can be cultured and tested for insecticidal 
activity as described in Example B hereinafter. 
Restriction mapping or DNA sequencing can be used to 
identify the insertion point of mTn3(HIS3) and hence the 
regions of DNA involved in toxicity. Similar approached 

25 can be used with other transposons such as Tn5 and mTn£. 

Site directed mutagenesis of cHRIMl as outlined in 
^Molecular Cloning, A Laboratory Manual" by Maniatis, 
Fritsch and Sambrook, (19B2) Cold Spring Harbor, can also 
30 be used to test the importance of specific regions of DNA 
for toxic activity. 

Alternatively, subcloning techniques can be used to 
identify regions of the cloned DNA which code for 
35 insecticidal activity. In this method, specific smaller 
fragments of the DNA are subcloned and the activity 
determined. To do this, cosmid DNA can be cut with a 
suitable restriction enzyme and ligated into a compatible 
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restriction site on a plasmid vector, such as pUC19. 
The ligation mix can be transformed into E. coli and 
transformed clones selected using a selection marker such 
as antibiotic resistance, which is coded for on the 
5 plasmid vector. Details of these techniques are 

described for example in Maniatis et al f supra, (see 
p390-391) and Methods in Molecular Biology, by L.G . 
Davies, M.D. Dibner and J.F. Battey, Elsevier, (see p222- 
224). 

10 

Individual colonies containing specific cloned fragments 
can be cultured and tested for activity as described in 
Example 8 hereinafter. Subclones with insecticidal 
activity can be further truncated using the same 
15 methodology to further identify regions of the DNA coding 
for activity. 

The invention also discloses an isolated pesticidal agent 
characterised in that the agent is obtainable from 

20 cultures of X. nematophilus or variants thereof, has oral 
pesticidal activity against Pieris brassicae, Pieris 
rapae and Plutella xylostella, is substantially heat 
stable to 55°C / is proteinaceous , acts synergistically 
with B. thuringiensis cells as an oral pesticide and is 

25 substantially resistant to proteolysis by trypsin and 
proteinase K. 

By 'substantially heat stable to 55°C I is meant that the 
agent retains some pesticidal activity when tested after 
30 heating the agent in suspension to 55°C for 10 minutes, 
and preferably retains at least 50% of the untreated 
activity. 

By 'substantially resistant to proteolysis' is meant that 
35 the agent retains some pesticidal activity when exposed 
to proteases at 30°C for 2 hours and preferably retains 
at least 50% of the untreated activity. 
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By 'acts synergistically 1 is meant that the activity of 
the combination of components is greater than one might 
expect from the use of the components individually. For 
example, when used in conjunction with B.thuringiensis 

5 cells as an oral pesticide, the concentration of B . 
thuringiensis cellular material necessary to give 50% 
mortality in a P.brassicae when used alone is reduced by 
at least 80% when it is used in combination the agent at 
a concentration sufficient to give 25% mortality when the 

10 agent is used alone* 

It has been found that the activity of the material is 
retained by 30 kDa cut-off filters but is only partly 
retained by 100 kDa filters. 

15 

Preferably the agent is still further characterised in 
that the pesticidal activity is lost through treatment at 
25 °C with sodium dodecyl sulphate (SDS - 0.1% 60 mins) 
and acetone (50%, 60 mins). 

20 

Clearly the characterising properties of the isolated 
agent described above can be utilised to purify it from, 
or enrich its concentration in, Xenorhabdus species cells 
and culture medium supernatants . Methods of purifying 

25 proteins from heterogenous mixtures are well known in the 
art (eg. ammonium sulphate precipitation, proteolysis, 
ultrafiltration with known molecular weight cut-off 
filters, ion-exchange chromatography, gel filtration, 
etc.). The oral pesticidal activity provides a 

30 convenient method of assaying the level of agent after 
each stage, or in each sample of eluent. Such 
methodology does not require inventive endeavour by thoBe 
skilled in the art. 

35 The invention further discloses oral pesticidal 

compositions comprising one or more agents as described 
above. Such compositions preferably further comprise 
other pesticidal materials from non-Xenorhabdus species. 
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These other materials may be chosen such as to have 
complementary properties to the agents described above , 
or act synergistically with it. 

5 Preferably the oral pesticidal composition comprises one 
or more pesticidal agents as described above in 
combination with B. thuringiensis (or with a toxin 
derived therefrom, preferably endotoxin). 

10 Recombinant DNA encoding said proteins also forms a 
further aspect of the invention . The DNA may be 
incorporated into an expression vector under the 
influence of suitable control elements such as promoters, 
enhancers, signal sequences etc. as is understood in the 

15 art. These expression vectors form a further aspect of 
the invention. They may be used to transform a host 
organism so as to ensure that the organism produces the 
toxin. 

20 The invention further makes available a hoBt organism 
comprising a nucleotide sequence coding for a pesticial 
agent as described above. 

Methods of cloning the sequence for a characterised 
25 protein into a host organism are well known in the art. 
For instance the protein may be purified and sequenced: 
as activity is not required for sequencing, SDS gel 
electrophoresis followed by blotting of the gel may be 
used to purify the protein. The protein sequence can be 
30 used to generate a nucleotide probe which can itself be 
used to identify suitable genomic fragments from a 
Xenorhabdus gene library. These fragments can then be 
inserted via a suitable vector into a host organism which 
can express the protein. The use of such general 
35 methodology is routine and non-inventive to those skilled 
in the art. Such techniques may be applied to the 
production of Xenorhabdus toxins other than those encoded 
by the sequence of Figure 2. 
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It may be desirable to manipulate (eg. mutate) the agent 
by altering its gene sequence (and hence protein 
structure) such as to optimise its physical or 
5 toxicological properties. 

It may also be desirable for the host to be engineered or 
selected such that it also expresses other proteinaceous 
pesticidal materials (eg. delta- endotoxin from B. 
10 thuringiensis) . Equally it may be desirable to generate 
host organisms which express fusion proteins composed of 
the active portion of the agent plus these other toxicity 
enhancing materials. 

15 A host may be selected for the purposes of generating 

large quantities of pesticidal materials for purification 
e.g. by using B. thuringiensis transformed with the agent- 
coding gene. Preferably however the host is a plant, 
which would thereby gain improved pest-resistance. 

20 Suitable plant vectors, eg. the Ti plasmid from 

Agrobacterium tumefaciens, are well known in the art. 
Alternatively the host may be selected such as to be 
directly pathogenic to pests, eg. an insect baculovirus. 

25 The teaching and scope of the present invention embraces 
all of these host organisms plus the agents, mutated 
agents or agent-fusion materials which they express. 

Thus the invention makes available methods, compositions, 
30 agents and organisms having industrially applicable 
pesticidal activity, being particularly suited to 
improved crop protection or insect-mediated disease 
control . 

35 The methods, compositions and agents of the present 

invention will now be described, by way of illustration 
only, through reference to the following non-limiting 
examples and figures. Other embodiments falling within 
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the scope of the invention will occur to those skilled in 
the art in the light of these. 

FIGURE 

5 Figure 1 shows the variation with time of the growth of 
X. nematophllus ATCC 19061 and activity of cells and 
supernatants against P. brassicae as described in Example 
3. 

10 Figure 2 shows the sequence of a major part of a cloned 
toxin gene from Xenorhabdus. 

Figure 3 shows a comparison of the restriction maps of 
cloned toxin genes from two strains of Xenorhabdus 
15 (clone 1 above and clone 3 below). 



EXAMPLES 

20 

Example 1 - Use of X. nematophilus cells as an oral 
insecticide 

CELL GROWTH : A subculture of X. nematophilus (ATCC 19061, 
25 Strain 9965 available from the National Collections of 
Industrial and Marine Bacteria, Aberdeen, Scotland) was 
used to inoculate 250 ml Erlenmeyer flasks each 
containing 50 ml of Luria Broth containing lOg tryptone, 
5g yeast extract and 5g NaCl per litre. Cultures were 
30 grown in the flasks at 27°C for 40hrs on a rotary shaker. 

PRODUCTION OF CELL SUSPENSION: Cultures were centrifuged 
at 5000 x fl for 10 mins. The supernatants were discarded 
and the cell pellets washed once and resuspended in an 
35 equal volume of phosphate buffered saline (8g NaCl, 1.44g 
Na 2 HP0 4 and 0.24g of KH 2 P0 4 per litre) at pH 7.4. 
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ACTIVITY OF CELL SUSPENSION TO INSECTS: The bioassays 
were as follows: P. brassicae: The larvae were allowed 
to feed on an artificial agar-based diet (as described by 
David and Gardiner (1965) London Nature, 207, 882-883) 

5 into which a series of dilutions of cell suspension had 
been incorporated. The bioassays were performed UBing a 
series of 5 doseB with a minimum of 2 5 larvae per dose. 
Untreated and heat-treated (55°C for 10 minutes) cells 
were tested. Mortality was recorded after 2 and 4 days 

10 with the temperature maintained at 25 °C. 

LC50 cells/g diet 

Treatment 2 day? 4 tiayp 

Untreated 5.9 x 10 5 9.8 x 10 4 

15 Treated 55°C 7,1 x 10 5 1.4 x 10 5 



Aedes aegypti: The larva were exposed to a series of 5 
different dilutions of cell suspension in deionised 
water. The biosassays were performed using 2 doses per 
20 dilution of 50 ml cell suspension in 9.5cm plastic cups 
with 25 second instar larvae per dose. Untreated and 
heat-treated (55°C or 80°C for 10 minutes) cells were 
tested. Mortality was recorded after 2 days with the 
temperature maintained at 25°C. 

LC50 cells/ml 
2 davs 
5.1 x 10 6 
7.4 x 10 6 



Treatment 

Untreated 
Treated 55 °C 
30 Treated 80 °C 



Culex quinquefaciatus : The larvae were exposed to a 

^ 7 

single concentration cell suspension containing 4 xlO 
cells/ml. The biosassays were performed using 2 50 ml 
35 cell suspensions in 9.5 cm plastic cups with 25 second 
instar larvae per cup. Untreated and heat-treated (55°C 
or 80°C for 10 minutes) cells were tested. Mortality was 
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recorded after 2 days with the temperature maintained at 
25°C. 



% Mortality 
Treatment 2 days 

Untreated 10° 
Treated 55°C 100 
Treated 80°C 0 



10 Thus these results clearly show that cells from X. 

nematophllus are effective as an oral insecticide against 
a number of insect species (and are particularly potent 
against P. brassicae) . The insecticidal activity is not 
dependent on cell viability (i.e is largely unaffected by 

15 heating to 55 °C which reduces cell viability by >99.99%) 
but is much reduced by heating to 80 °C, which denatures 
most proteins. 

Example 2 - Use of X.nematophilus supernatant as an oral 
20 insecticide 

CELL GROWTH: Cultures were grown as in Example 1. 

PRODUCTION OF SUPERNATANT: Cultures were centrifuged 
25 twice at lOOOOg for 10 mins. The cell pellets were 
discarded. 

ACTIVITY OF SUPERNATANT TO INSECTS: The Bioassay was as 
follows : 

30 Activity against neonate P. brassicae and two day old 

Pieris rapae and Plutella xylostella larvae was measured 
as for P. brassicae in Example 1, but using a series of 
untreated dilutions of supernatant in place of of cell 
supensions and with mortality being recorded after 4 days 

35 only. 
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Insect species 
P. brassicae 
5 P. rapae 

P. xylostella 

In addition, size-reducing activity (62% reduction in 7 
days) againBt Mamestra brassicae was detected in larvae 
10 fed on an artificial diet containing X. nematophilus 
supernatant (results not shown). 

Thus these results clearly show that the supernatant from 
X. nematophilus culture medium is effective as an oral 
15 insecticide against a number of insect species, and are 
particularly potent against P. brassicae . 

The heating of supernatants to 55°C for 10 minutes caused 
a partial loss of activity while 80°C caused complete 

20 los6 of activity. Activity was also completely lost by 
treatment with SDS (0.1%w/v for 60 mins) and Acetone (50% 
v/v for 60 mins) but was unaffected by Triton X-100 (0.1% 
60 mins), non-diet P40 (0.1% 60 mins), NaCl ( 1 M f or 60 
mins) or cold storage at 4°C or -20°C for 2 weeks. All 

25 of these properties are consistent with a proteinaceous 
agent . 

The general mode of action of X. nematophilus cells and 
supernatants i.e. reduction in larval size and death 
30 within 2 days at high dosages, and other properties, eg. 
temperature resistence, appear to be similar suggesting a 
single agent or type of agent may be responsible for the 
oral insecticide activity activities of both cells and 
supernatants . 

35 

Example 3 - Timescale for appearance of ingestable 
insecticidal activity 



LC50 (jil supernatant /g diet) 
4 days 
22 
79 
135 
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CELL GROWTH: 1ml of an overnight culture of X. 
nematophllus was used to inoculate an Erlenmeyer flask. 
Cells were then cultured as in Example 1. Growth was 
estimated by measuring the optical density at 600 nm. 

5 

PRODUCTION OF CELL SUSPENSION AND SUPERNATANTS : These 
were produced as in Examples 1 and 2 . 

ACTIVITY OF CELLS AND SUPERNATANTS AGAINST P. BRASSICAE: 
10 The cell suspension bioassay was carried out as in 

Example 1, but using a single dose of suspended cells 
equivalent to 50 pi of broth/g diet and measuring 
mortality after 2 days. The cell supernatant bioassay 
was carried out as in Example 2, but using a single dose 
15 equivalent to 50 \il supernatant/g diet (i.e. more than 
twice the LC50) and measuring mortality after 2 days. 

The results are shown in Fig. 1. Thus these results 
clearly show that cells taken from X. nematophllus 

20 culture medium are highly effective as an oral 

insecticide against P. brassicae after only 5 hours, and 
supernatants are highly effective after 20 hours. 
Although some slight cell lysis was observed in the early 
stages of growth, no significant cell lysis was observed 

25 after this point demonstrating that the supernatant 

activity may be due to an authentic extracellular agent 
(as opposed to one released only after cell breakdown). 

Example 4 - Synergy between X. nematophllus cells and 
30 B. thuringiensis powder preparations 

CELL GROWTH AND SUSPENSION: X. nematophllus cells were 
grown and suspended as in Example 1. B. thurlngiensis 
strain HD1 (from Bacillus Genetic Stock Centre, The Ohio 
35 State University, Columbus, Ohio 43210, USA) was 

cultured, harvested and formulated into a powder as 
described by Dulmage et al. (1970) J. Invertebrate 
Pathology 15, 15-20. 
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ACTIVITY OF X. NEMA TOPHI L US CELLS AND B. THURINGIENSIS 
POWDER AGAINST P. BRASSICAE : The bioassays was carried 
out using X. nematophilus and B. thuringiensis in 

5 combination or using B. thuringiensis cell powder alone. 
Bioassays were carried out as in Example 1 but with 
various dilutions of B. thuringiensis powder in place of 
X. nematophilus. For the combination experiment, a 
constant dose of X. nematophilus cell suspension 

10 sufficient to give 25% mortaility was also added to the 
diet. Mortality waB recorded after 2 days. 

BiPflggay . 

15 B.t. alone 

B.t. plus X. nematophilus 

These results clearly demonstrate the synergism between 
X. nematophilus cells and B. thuringiensis powder when 
20 acting as an oral insecticide against P. brassicae. 

Example 5 - Synergy between of X, nematophilus 
supernatants and B . thuringiensis powder 

25 CELL GROWTH AND PRODUCTION OF SUPERNATANTS: X. 

nematophilus cells were grown and supernatants prepared 
as in Example 2. B. thuringiensis was grown and treated 
as in Example 4 . 

30 ACTIVITY OF X. NEMATOPHILUS SUPERNATANTS AND Bt CELL 
POWDER AGAINST P. BRASSICAE: 

The bioassays were carried out using X. nematophilus 
supernatants and B. thuringiensis in combination or using 
B. thuringiensis powder alone. The Bioassay against 
35 neonate P. brassicae and two day old Pieris rapae and 

Plutella xylostella larvae were measured as in Example 2 
but with various dilutions of B. thuringiensis in place 
of X. nematophilus. For the combination experiment, a 
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constant dose of X. nematophilus supernatant sufficient 
to give 25% mortality was also added to the diet. 
Mortality was recorded after 4 days. 

5 LC 50 <yg Bt powder/g) 

diet 

a iasst secies Bt alone Pt p l us xn 

P. brassicae 1 - 4 0.12 

P. rapae 2.5 0.26 

10 P. xylostella 7.2 0.63 

These results clearly demonstrate the synergism between 
X. nematophilus supematants and B. thuringiensis powder 
when acting as an oral insecticide against several insect 
15 species. The fact that both X. nematophilus cells and 
supematants demonstrate this synergism strongly suggests 
that a single agent or type of agent is responsible for 
the demonstrated activities. 



20 



Example 5 - Characterisation of insect icidal agent from 
X. nematophilus supernatant by proteolysis 



CELL GROWTH AND PRODUCTION OF SUPERNATANTS : X. 
nematophilus cells were grown and supematants prepared 
25 as in Example 2. 

PROTEOLYSIS OF SUPERNATANT: Culture supernatant (50ml) 
was dialysed against 0.5 M NaCl (3 x 1 1) for 48 hours at 
4«C. The volume of the supernatant in the dialysis tube 
30 was reduced five-fold by covering with polyethylene 

glycol 8000 (Sigma chemicals). Samples were removed and 
treated with either trypsin (Sigma T8253 = 10,000 
units/mg) or proteinase K (Sigma P0390 = 10 units/rag) at 
a concentration of 0.1 mg protease/ml sample for 2 hours 

35 at 30°C. 

ACTIVITY OF PROTEASE TREATED SUPERNATANT AGAINST P. 
BRASSICAE: The boassay against neonate P. brassicae 
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20 

larvae was carried out by spreading 25 yl of each 
•treatment' on the artificial agar-based diet referred to 
in Example 1 in a 4 . 5 cm diameter plastic pot. Four pots 
each containing 10 larvae were used for each treatment. 
Mortalities were recorded after 1 and 2 days. Controls 
using water only, trypsin (0.1 mg/ml) and proteinase K 
(0.1 mg/ml) were also tested in the same way. 





% Mortality 




Treatment 


1 day 


2 days 


Untreated supernatant 


60 


100 


Proteinase K treated supernatant 


45 


100 


Trypsin treated supernatant 


40 


100 


All controls (no supernatant) 


0 


0 



15 



20 



Example 6 

Entomocida i activity o f other Jtenoriiafrdus 

Using the methodology of Examples 1 and 2, four different 

xenorhabdus strains were tested against insect pests. 

The results obtained were as follows: 

I) Activity to Pieris brassicae 



Strain deposit 
no/code 

NCIMB'**40887 

0014 

0015 

NCIMB 40886 



Cells loVgrm diet Supernatant LC50 

% mortality ^il/gram of diet 

100 0709 

100 0.52 
80 3.73 
100 0.05 



25 It was found that entomocidai activity of cells and 

supernatant was reduced by more than 99% when all four 
strains were heated at 80°C for 10 minutes. 
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II) Activity to mosquitoes (Aedes aegypti) 
Bacteria added at the rate of 10 7 cells/ml of water 



Strain deposit 


Cells 10*79™ diet 


no/code 


% mortality 


NCIMB 40887 


0 


0014 


40 


0015 


45 


NCIMB 40886 


95 



5 Furthermore, all strains significantly reduced the growth 
of Heliothis virescens . 

Example 7 

OfijUflg nf toxin rp^nes from strains of JfeflPfftflfrdHg 
10 Total cellular DNA was isolated from NCIMB 40887 and ATCC 
19061 using a Quiagen genomic purification DNA kit. 
Cells were grown in L borth (lOg tryptone, 5g yeast 
extract and 5g NaCl per 1) at 28°C with shaking (150rpm) 
to an optical density of 1.5 A 600 . Cultures were 
15 harvested by centrif ugation at 4000xg and resuspended in 
3.5mls of buffer Bl (50mM Tris/HCl, 0.05% Tween 20, 0.5% 
Triton X-100, pH7.0) and incubated for 30 mins at 50°C. 
DNA was isolated from bacterial lysates using Quiagen 
100/G tips as per manufacturers instructions. The 
20 resulting purified DNA was stored at -20°C in TE buffer 
(lOmM Tris, ImM EDTA, pH 8.0). 

A representative DNA library was produced using total DNA 
of NCIMB 40887 and ATTC 19061 partially digested with the 

25 restriction enzyme Sau3a. Approximately 20ng of DNA from 
each strain was incubated at 37°C with 0.25 units of the 
enzyme. At time intervals of 10 , 20, 30, 45 and 60 
minutes, samples were withdrawn and heated at 65°C for 15 
minutes. To visualise the size of the DNA fragments, the 

30 samples were electrophoresed on 0.5% w/v agarose gels. 
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The DNA samples which contained the highest proportion of 
30 to 50kb fragments were combined and treated with 4 
units of shrimp alkaline phosphatase (Boehringer) for 15 
minutes at 37°C, followed by heat treatment at 65°C to 
5 inactivate the phosphatase. 

The size selected DNA fragments were ligated into the 
BamHl site of the cosmid vector SuperCos! (Stratagent) 
and packaged into the Escherichia coli strain XL Blue 1, 
10 using a Gigapack II packaging kit (Stratgene) in 
accordance with the manufacturers instructions. 

To select for cosmid clones with entomocidal activity, 
individual colonies selected on L agar plates containing 

15 25ug/ml ampicillin, were grown in L broth (containing 
25Mg/ml ampicillin) overnight at 28°C. Broth cultures 
<50ul) were individually spread onto the surface of 
insect diet contained in 4.5cm diameter pots, as 
described in Example 5. To each container 10 neonate P. 

20 brassicae larvae were added. Larvae were examined after 
24, 72 and 96 hours recording mortality and size of 
surviving larvae. A total of 220 clones of NCIMB 40B87 
were tested, of which two were found to cause reduction 
in larval growth and death within 72 hours. Of 370 

25 clones from ATTC 19061, one was found to cause larval 
death within 72 hours. 



Example 8 

ft^ti vifV nf nlo n f H tovin nftnes to Pjeris brassiere 
30 The three active clones from Example 7 were grown in L 
broth, containing 25ug/ml ampicillin, for 24 hours at 
28°C, on a rotary shaker at 150rpm. The activity of the 
toxin clones to neonate larvae were performed by 
incorporation of whole broth cultures into insect diet, 
35 as described in Example 1 . 
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Clpn? ftp 



Strain 



T.C50 ful b roth /q insect diet) 



1 



NCIMB 40887 
NCIMB 40887 
ATTC 19061 



13,03 



2 



16.7 



3 



108.7 



Control* 



No effect at 100nl/g 



*XL1 Blue E. coli broth 



5 



When E. coli toxin clones were heated at 80°C for 10 
minutes and added to the diet at a rate of 100nl/g, no 
activity to larvae was detected. Highlighting the heat 
sensitivity of the toxins. 

10 

Example 9 

SSgasncin g of th e cloned toxin from NCIMB 4PP97 

Cosmid DNA of the entomocidal clone 1 above from NCIMB 

15 40887 was purified using the Wizard Plus SV DNA system 
(Promega) in accordance with the manufacturers 
instructions. A partial map of the cloned fragment was 
obtained using a range of restriction enzymes EcoRl, 
BamHl, Hindlll, Sail and Sacl as shown in Figure 3. DNA 

20 sequencing was intiatiated from pUC18 and pUC19 based 

sub-clones of the cosmid, using the enzymes EcoRl, BamHl, 
Hindlll, EcoRV and PvuII. Sequence gaps were filled 
using a primer walking approach on purified cosmid DNA. 
Sequence reactions were performed using the ABI PRISM™ 

25 Dye Terminator Cycle Sequencing Ready Reaction Kit with 
AmmpliTaq DNA polymerase FS according to the 
manufacturers instructions. The samples were analysed on 
an ABI automated sequencer according to the manufacturers 
instructions. The major part of the DNA sequence for the 

30 cloned toxin fragment is shown in Figure 2. 
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Example 10 

p ftff ^ rirt inn map nf rinned toxin from clone 3 
Cosmid DNA of the entomocidal clone 3 above was purified 
5 as described in Example 9. A restriction map of the 
cloned fragment was obtained using the restriction 
enzymes BamHl, HindHI, Sail and Sacl and this is shown 
in Figure 3. When compared with the map from clone 1 
(Figure 3) it is clear that over the regions which 
10 overlap, the restriction maps are very similar. The 
only detectable difference between the two clones was a 
reduction in size of two Hindlll fragments in clone 3, 
corresponding to the U.4kb and 7.2kb Hindlll fragments 
in clone 1 by approximately 2Kb and 200bp respectively. 
15 These results indicate the overall relatedness of the DNA 
region coding for toxicity in the two bacterial strains. 

Example 11 

c^i- h^n Riot Hyhridisation Experiments 

20 A 10.3kb BamHl-Sall fragment of the DNA from clone 1 was 
used as a probe to hybidise to total Hindi I I digested DNA 
of the Xenorhabdus strains ATCC 19061, NCIMB 40886 and 
NCIMB 40887. Hybridisation was performed with 20ng/ml of 
DIG labelled DNA probe at 65°C for 18 hours. Filters 
25 were washed prior to immunological detection twice for 5 
minutes with 2 x SSC (0.3M NaCl, 30mM sodium citrate, pH 
7.0)/0.1% (w/v) sodium dodecyl sulphate at room 
temperature, and twice for 15 minutes with 0.1 x SSC 
(15mM NaClm 1.5 mM sodium citrate, pH 7.0) plus 0.1% 
30 sodium dodecyl sulphate at 65°C. The probe was labelled 
and experiments performed in accordance with 
manufacturers instructions, using a non-radioactive DIG 
DNA labelling and detection kit (Boehringer) . The probe 
hybridised to a Hindi I I fragment of approximately 8kb in 
35 all three strains as well as an 11.4kb fragment in NCIMB 
40887 and an approximate 9kb fragment in both NCIMB 40886 
and ATCC 19061. These results show that strains NCIMB 
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40886 and &TCC 19061 contain DNA with close homology to 
the toxin gene of clone 1 above, confirming the 
similarity between the toxins produced by the three 
strains . 



8 
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CLAIMS 

1. An insect icidal composition adapted for oral 
5 administration to an insect comprising a pesticidal 
material obtainable from a Xenorhabdus species, or a 
pesticidal fragment thereof, or a pesticidal variant or 
derivative of either of these. 

10 2. A composition according to claim 1 wherein the said 
pesticidal material comprises material encoded by the 
nucleotide sequence of Figure 2 or variant or fragment 
thereof, or a sequence which hybridises with said 
sequence. 

15 

3. A composition according to claim 1 or claim 2 which 
comprises cells of Xenorhabdus. 

4. A composition as claimed in any one of the 

20 preceding claims which comprises supernatant taken from 
cultures of cells of Xenorhabdus species. 

5. A composition according to any one of the preceding 
claims wherein the Xenorhabdus species is Xenorhabdus 

25 nematophilus. 

6. A composition according to any one of claims 1 to 4 
wherein the Xenorhabdus species is ATCC 19061, NCIMB 
40886 or NCIMB 40887. 

30 

7. A composition as claimed in any one of the preceding 
claims which comprises a further pesticidal material not 
obtainable from Xenorhabdus . 

35 8. A composition according to claim 7 wherein the said 
further pesticidal material comprises a material 
obtainable from fl. thuringiensis. 
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9. A composition according to claim 8 which further 
comprises cells of B, thuringiensis. 

10. A composition according to claim 8 wherein the 

5 pesticidal materials obtainable from B. thuringiensis 
comprises the delta endotoxin. 

11. A composition according to any one of the preceding 
claims which further comprises an agriculturally 

10 acceptable carrier. 

12. A composition according to claim 10 wherein the 
carrier comprises items of insect diet. 

15 13. A method for killing or controlling insect pests, 
which method comprises administering to a pest or the 
environment thereof a composition according to any one of 
the preceding claims. 

20 14. A method as claimed in claim 12 wherein the pests 
are insects from the order Lepidoptera or Diptera. 

15. A microorganism comprising Xenorhabdus strain NCIMB 
40886. 

25 

16. A microorganism comprising Xenorhabdus strain NCIMB 
40887. 

17. A pesticidal agent which comprises a a toxin 
30 comprising a protein which is encoded by DNA which 

includes SEQ ID No. 1 or a variant or fragment thereof. 

18. An isolated pesticidal agent characterised in that 
it is obtainable from cultures of X. nematophilus or 

35 mutants thereof/ has oral pesticidal activity against 
Pieris brassicae, Pleris rapae and Plutella xylostella, 
is substantially heat stable to 55°C f is proteinaceous r 
acts synergistically with B. thuringiensis cells as an 
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oral pesticide, and is substantially resistant to 
proteolysis by trypsin and proteinase K. 

19. An isolated pesticidal agent as claimed in claim 18 

5 further characterised in that the pesticidal activity is 
substantially destroyed by treatment with sodium dodecyl 
sulphate or acetone or heating to 80°C. 

20. An isolated pesticidal agent as claimed in claim 18 

10 or claim 19 further characterised in that the agent is an 
extracellular protein. 

21. A recombinant DNA which encodes a pesticidal agent 
according to any one of claims 17 to 20. 

22. A recombinant DNA of claim 21 which comprises the 
sequence of Figure 2 or a variant or fragment thereof. 

23. A recombinant DNA which comprises or hybridises 
20 under stringent conditions with all or part of the 

sequence of Figure 2, and which encodes a pesticidal 
material. 

24. An expression vector comprising a recombinant DNA 
25 according to any one of claims 21 to 23. 

25. A host organism which has been transformed with an 
expression vector according to claim 24. 

30 26. A host organism as claimed in claim 25 which has been 
engineered or selected such that it also expresses other 
pesticidal proteinaceous toxicity enhancing materials 

27. A host organism comprising a nucleotide sequence 
35 coding for a fusion protein comprising a pesticidally 
active portion of an agent as claimed in any one of 
claims 17 to 20 in combination with other pesticidal 
proteinaceous toxicity enhancing materials. 
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28. A host organism as claimed in claim 27 wherein the 
pesticidal toxicity enhancing materials comprise delta- 
endotoxin from B. thuringiensis . 

5 

29. A host organism as claimed in any one of claims 25 to 
289 wherein the host is a plant. 

30. A host organism as claimed in any one of claims 25 to 
10 28 wherein the host is a virus pathogenic to insects. 

31. A fusion protein as expressed by a host as claimed in 
claim 27 . 

15 32. An pesticidal composition comprising one or more 
agents as claimed in any one of claims 17 to 20. 



II 
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121 
181 
241 
301 
361 
421 
481 
541 
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721 
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B41 
901 
961 
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1141 
1201 
1261 
1321 
1381 
1441 
1501 
1561 
1621 
1681 
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1801 
1861 
1921 
1981 
2041 
2101 
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2221 
2281 
2341 
2401 
2461 
2521 
2581 
2641 
2701 
2761 
2821 
2881 
2941 
3001 
3061 
3121 
3181 
3241 
3301 
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Fig.2. 

TCCACAATTG 
CGAATTTGCC 
AGCGTTAACG 
TGTCAA TGAT 
ATTGTACTTG 
AACACTGACT 
TCGAAAAGTG 
AGAATGGCTA 
GCTTTAAGTC 
TAGATGTATA 
AATGGGTGAA 
AGCAAGGCTT 
AGGGGGGCTT 
ATTTTTTGGT 
AGTTAATCAC 
TATATCCTGT 
CATCATTTAC 
GCCTGAATCA 
CCTGAATCAT 
ACCCATTGAT 
GCCAGATGGG 
ATATTTCCTA 
GTGATGCTCG 
GCATACGCAA 
AAGCCACTTA 
CGTTATCGTT 
CTGCCGGATG 
CTTACGGATG 
CTATACCGTT 
AACCATTTAT 
GAAACTACAG 
TTCTGTGATT 
GGCACTGCAA 
CTGTTCGGAG 
TGTCAGCGAC 
TGCCAAAGAA 
ATAAATACAG 
TCAGGGCGCG 
TTTTAATAGT 
AGAAAACTAA 
TGCTCGCGCC 
ACCAGATATT 
TCAACTGGAA 
ATATGGTGAA 
AAATATGATG 
GAATGGTTTG 
AATATAAATG 
TTGATTTCAG 
TGGATAGTTT 
GATCTGAATC 
AAAAAAAACA 
CCAATGAAAT 
TAATCCAACG 
ACCATATCAG 
AAAAATTCAT 
GGGTGGAAAA 
GAAGACAGAA 
GAAAGAGTTC 
GTGA7TTAAT 



CCGGAGAAAA 

GACCAGAATA 

GTAAAGAGTC 

ATGACCGGAA 

TATCATGCCT 

AAGTTTGGTG 

CGAACCAAGA 

CCGTCAGTGC 

TGCACTCCAT 

GTTATTTTTT 

CAGGTGATAC 

TCAGGGAATT 

ATGGCAGGTA 

ACTACCTCAA 

AATTTCATTG 

TTCATTACGC 

TTGTCGTTAA 

ATTGGAAATC 

TCGATGAGCC 

AAATCCTCTC 

ACAACTGGAG 

TAATTATCCG 

ATTATAAGTC 

TTAGCCGGAA 

AGAAGCCGCT 

GTGTGGATGA 

GTCAGATAAA 

ACAAAAATAA 

CAGACAAAAA 

CGTAAGGATG 

AGTAATTATG 

TATA TTATG C 

ACCCATTTTA 

AAAAAACGGG 

GGGGATAATA 

TATCCCATTG 

GACAGGTTAT 

AGCACACTAT 

G TTTTTA TCG 

AGAATTAAAG 

CTGAGTTTAT 

TATGAGCAAG 

ATTAAGCCGG 

AGAATATCTG 

ATGAGGTTGA 

AAAAATCAGG 

ACATAGTAAC 

CAGGAATGTT 

GGGAAGGAGT 

AATATGTAAA 

AATCACTAGA 

AACATGAAAA 

CCAAAAGTTT 

GCATCAATTA 

CCTACTGGCT 

AAAGAAATAA 

TTGATAAAAA 

ACTATAAATT 

TCGCCATTTT 



TCAGTCGGGA ACTGCCGGTG ATTATTCGTC ACTTATTAAA 
AGGCTAAAAA ACTGCTACAG GCGCAACGCG ACTCGAACGA 
ATTCGGATCC GCTGTATCGC TTTTGTGGTT ATCTGGTGTC 
TGAAGATGGG CAATAAAAAC ATTAGCCCAC GAGCACCGAG 
ATCTCTCTTT TATGGAAGCG CACGGCTTTG AACGTCCGTT 
AATCCATCCC CAAGATTATG CTGGAATACC GGAAGGAGTA 
AAGGCTATTC CTATAACGTG GAATTATCGG AAGAGGCCGA 
CTGAGTGTCG AGACTTTAAA TCACCTGTAT AAAACTTTGA 
ACACAACTTA AAATATCTAA TTGTATTTAA AAGAAAATAA 
AACTATACAT AAGCTCTACA TGCTCTTCAT TCGTGTAAAA 
AGTCAGTGAA TATCATATTA ATTACCGTAA ACCCAGATGT 
GTGCAGAGGG TGCATAACTG AGAGGGTGAA AAAGATTTTC 
AACAAAATCA GAAGCAAATA CCGTGCACAA TCTGGTTTTT 
ATTAAAATGA TGTAATCATC TGATTTTATT TAAGAATAGA 
ATGGACTTTC ATTCACACTG GTATAGATAA ATAATTCTGT 
ATTCATCAGG AGTGCTGTTA CAGGAGACAA GAATGTCACA 
AGGGCAAGAA GCAGGGTTTA ATTTCAGCGG GTTGTTCAAC 
GCTATCAAAA AGGACGTGAA GATCAAATAC AGGTATTGAG 
GTGACCAGAA TGTTAATCAT CAACCCGTCA GTTTTGTGAA 
CCCTGTTTGC TGGATGCCAG TTTTGTGCAT TACAGGACAA 
TTCTTTTATG AAATCAAGCT GACCAGTGCC ACGATTGTGG 
GCATTCAATC AATGATAATG GTGCGATACC CCATGAAGTG 
CATTTCATGC AACCACATCG CCGCAGGACT TCGGGCTACA 
GTGAAGAAGC AAGCCGCTTT TATCTGGGGT CTCGAATGTT 
GGTTGAAGAA ACCCCGGTAA AACCCGCTAA ACATCATGCC 
TGACGGCAAT CTTTTAACCG AACGCAAGTA TCGGGTTTGC 



AAGGTTACAC CCAATGGCAT 
AGGATTAATA CCATGCCAGC 
GAAAACCTGC TTTACGACTT 
TTGCTTGATG TTTTTCAGGA 
AAACACAACA GCA7ATCAGG CAGGAAATAG ACGACGATCT 



AGAAGGAAAG ACTGATAAAC 
ACTTGAATTT 
TAGAATCCAA 
CAAAAGGAAA 



CAT A . . . 
CGTACCTGT"! 
TTTC CATA.T ' 




AAATTATGC 
AGAAAATGTA 
AAAATGCCTG 
CCGTTGACTT 
GTCACCCACA CGAT 
CGAAAAGAAT 
TTTAGCTGCG 
AGTGAAATTT AATCGCA 
AACAAGATTG ACA' 
GTGGCCCTGC CG 



CATGGCTCAA ATATATTTCC 
GAATTAACTT CCGGTAAAGC 
GAA AGTACAGTTG AAACAAAACC 
AAA7A7CACT ATTCCTGAAC GACCTTTTAT 
,TTT GAAAAAAGTA AAATTGAATC 
GAT CAAAATGGAG CAAGTTTATG 
GAT GATTAT CTCT TAATGTTCAG 
.G GCAATTCTTT AGACTTTTAT 
TTCAAATATT AATCAAAGTA 
TATTGCCTGC CAATAGATAG 



"TAT 



CGGCACGAGA ATTATGGCAA TATGGCCGAA CTAAAATTGG 
GTGAGGGTTG CCGACATCCT AAAGGTACTT TTTATAATCA 
GGTTAGATTG GCTGACATTG GCAAGCCTAA GAGATTCAGA 
TGATGAAGTA GCTGGTATTA CAATGTGGGG AAAATTGACA 
GTATGAAAAA GTATTTAGTA ATGTCGGCTT ATCCCATTCT 
TCTTAGTGAT TACTATAACA AAGGATATCA TGTTGTTACT 
ATCAGATTTT GGTGACATAG AAACATCAGG AAAAAATCAT 
AGTAGAAAAC TATGAGAAAG AAAATATCAC AAATAATTCA 
TTTAAATCTG TTTTCATGGG GTAAAGTGGA AC ATCAA ATT 
TTATGTACTC AACCATATTT TTTGAGGGTT GGTTTTTAAA 
AAATATTAAT TATTTTTATT TTTTTACTTT ATGGTTGTGG 
TACCAAAATC AGAGTTTCTT CCTGATGCAG TGATAAATGA 
CCATCACAGG AGGTGCATTG AATGAAAAAA GCGTTTGGGT 
CAGGACTAAC ATGGAATCCA AAAGATAGTT CTTTCCTATA 
GAAAAGATTA TCATCATATA AATATAACAG GTACCCCAAA 
TTGAAGTGGT AGGATTTACA TTGGGTACAA TGTACGCACG 
ATACTATAAA AGTAAGGGAA TAATTGTCAC TATCAGAATG 
TATACTTTTG TATACTCTCT CAACATAATC AGGATTCTTT 
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CTTATTATTT 
AAATTATCTG 
ATGAACTCGA 
AATCACCCAC 
AGCAAGCCGT 
GTTCACATGT 
CCATGACGCC 
GCGTCGAAGC 
AAAAA ACAG G 
CGGAGTTTGC 
ATTACCGTCT 
TTACCAGACA 
GGCCAGACAA 
GGGATAGAGG 
CTTCCCGCTT 
GACTGGCGAC 
TTGAAAGGTT 
GTCAGTCGAG 
TCTGAAAATT 
TGTCAACATA 
AAATCACTTG 
TCAGTTATAA 
TTA ACGTAA A 
TGTGTTTTGT 
TACAATATGC 
TTCAACCGTA 
ATCCTATTTA 
TGTGCTCAAT 
GATG AAGAG G 
TAAATTTTAT 
ATAGCATCCT 
ATAATGACTT 
TTATAAAATA 
CACTCGCGAC 
GAGAAAAATC 
TATAGAGCAG 
ACAATTATCC 
TGAAATGTTT 
TTCACCGGCT 
CTCTGCTTAT 
TAATATGGAT 
ACCCGCAAGA 
GCCATTGATA 
GACAGTACAC 
TCATTACTGG 
ATTACGGAAA 
GAAAATTTCG 
CAAAAATACC 
AATATCTCAA 
CGTGTAAAAA 
AATAATCAAT 
AGGAAAAACT 
ACTAATTTCA 
AAAATATATG 
TTCACTTTTG 
TTGTGCCTGA 
GCACAAGGCA 
CACCGTTATG 
TATGCCCGAC 
AGGGAAAATC 
TACCTTTGCC 
GTTAGGCAAA 
TATATCTTCA 
ACTGTGTATG 



TTCAT GGTGC 
CATTACTGTT 
AGATAATCCT 
ACCGGGACAT 
CCCAGATCAC 
GGCACAGATA 
TGACATGATT 
CAGGCAGGGT 
GGCTCCCCTA 
TGAAAAATCC 
GGTCTATTTT 
ACGTCAGGCA 
GGGACAACCG 
AAAAAATCAG 
A CAGC CCAGA 
GTTTTATCAC 
ATGGCGACCA 
GTTTTTTCAT 
GCTGTTTCTG 
CTGTTAA GTT 
GACAATATTT 
CCGAATAAGG 
AGTTGATAAA 
TAATGAATGA 
TATTTATTTA 
ACTTAGCTTC 
TTTATGATAA 
CTTGGATTCA 
ATGCCAACAT 
GATTAAAATG 
CTTTAATATC 
ACAAAATAAG 
ATTGAGGTTA 
GGTCAGACGA 
TTTGATGACC 
AAAAAAAATA 
GGTGCTATCC 
GGTGCCCGTT 
GGCTATCTCA 
CATCTTGATA 
ACAGAAATTT 
CCGGAGGTGA 
CCCCTTACCA 
TGTCAGCGCT 
CGATTCTGGC 
AGAACGCTGA 
CGTCACAATC 
TCGGGATGTT 
CGGGTTTAGT 
C AGAT GATTA 
TCTTTATATG 
CAGGGACAAG 
AAAGCAATTA 
CCTATCGCTA 
AGTCTTATCC 
CTAGCGGGCT 
TCATCAACGA 
CACTGAGCTT 
GATGACAGTG 
TTTGAAGCCG 
CGTTCAGCCC 
CTGGCGGGTG 
CTGTATCGCC 
CTTTATGGTT 
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TAAAAACGTT 
ATAATCGATA 
TTCTGAGCCT 
TCGTACGCGA 
CGCTGAAATC 
GCGGGATTAT 
GCCACTGCGC 
TTCCCTGCCT 
TAAACGCCCC 
GCCTTGCTGA 
GAGTTCTGGG 
GTATCACGCG 
CCTGACATTT 
AAATGGCGGG 
GCTGTATCTG 
CTGGACTCAG 
ATTTGCAATT 
TTCGGGTCGT 
TGGCTACGTC 
AGATGTTGAT 
TATTTCACAT 
ATCTTGAAAA 
GAAAATTATT 
ATAACCAGGT 
TATAAAGAGT 
ATCGACTTTT 
ATAAAATTTA 
AGTATGTATT 
GACACAATAT 
AAATTTTAGT 
ATTAATCTCA 
CACTAAATCT 
TTATGTATAG 
TGACTCTTGC 
AGCTCAGTTG 
ATCGCTTGCT 
GACTCGGTAT 
CTTCTTCCTT 
CCGAATTGTA 
ATCGCCGTCC 
CCACCCTGAC 
TTCGGACGCA 
TCAGCCTTAC 
GTCCCGTAAT 
CAATATTTCT 
TGCTTTATTT 
ATGGATAGCC 
GCAGAATGGC 
GGTCAATAAT 
TGATAAACAT 
TGCTAATTTT 
TGGCATTGTC 
CTTAAGTAAC 
TACGTCTTCC 
CCTGACTATA 
TTCACCGAAT 
CTCCGTTCTG 
TGATGATGCA 
TCAGTCATTT 
ACGGCAACAC 
TGATGCGTGG 
TATTGGACAC 
TCACGTTACT 
TTTCGCCGTT 



TATTGCAAAA 
ACACGATAAC 
GAACGAATCA 
GGAACGGGTT 
GG ATGCAGTC 
TCGGCGGTCA 
TCGAAGCCGC 
TGTACGCTTG 
CGCCTGTCGC 
ATAAAATTAA 
GGCGTTAAAT 
AGATGACGTG 
TTAGTGTTGG 
TGACGAGAAC 
ATTGAAATCG 
GA TACAAT GG 
AACTTTTCTT 
GGGGATGATA 
TGTCTTTTGG 
AAAAGAGACT 
GAGACATTAA 
ATCATGGGAT 
TAATTCTAAG 
AAGCTGGATT 
TTGTGCCCAT 
GGCCTCGCCT 
ATTATCTTTA 
CCTTTTGGTA 
CGATTACGAC 
AGAAAATCGT 
GATAAAACAA 
TCAGATGAAC 
CACGGCTGTA 
GGATCTGCAA 
GGGAGAGGCT 
GGAAGCGCGT 
TGAACGAGAC 
TGTGAAACCG 
TCGTGAAGCG 
GGATCTGGCT 
ACTGTCTAAC 
TTGATGGAGA 
GAGACTATCC 
CCTGAGGTGA 
CCAGAACTGT 
GCGCAAAACT 
AAGTATTATG 
TATTCTGACA 
GAAAGTAAAC 
GTAAATTACT 
AAGATATCGA 
GGCAGCCTTT 
ATATCTGATA 
ACCAGCGCCA 
TTTGCGCTCA 
GAACTGCAAA 
ACCAAAGTTT 
CAGGTACTGA 
TAACCGTCTC 
GGTCAGCATT 
TCTGGGGATC 
ACAAAATATC 
GGCCCGTGCC 
CAATGGCAAA 



ATAAATTAAG 
CTGACTTTCT 
CATTGCAACC 
TACTCATGCT 
TCCGGGTTAT 
TGCCGGAGGC 
CAGCGCAGAG 
AAACGCTGGC 
TTAAAAAAAG 
GGCCGGAGCA 
TACAC GGATA 
ATTGATTTTT 
ATAATGCGCG 
ACAACCTGTT 
TCTGGAAACA 
AATATGAGGT 
GAGTACT TAG 
CTGAAAATTT 
GATATTGTTT 
GAATT ATAAT 
GGTTGATTTT 
CTTACTTTTA 
TGCC GTTGGC 
TTCATTTTTT 
TTAACCAGTA 
GGTCAGAATC 
ATAAG CTGA A 
CCCTGCTTTA 
TGTAACATTA 
ATTCTATTCC 
ATAATTACAA 
TCTTAACTGA 
TTACTCAATA 
TATTTATCCT 
CGCCATCTCT 
ATTTTTACCC 
AGCGTTTCAC 
GGTTCAGTGG 
AAGGACTTAC 
GATCTGACTC 
GAACTGTTGC 
GCCTGTCAAC 
GTCAGGTCAT 
TGGGGCAGGC 
ATAACATTTT 
TC AGTG AAAA 
GTCTTGAACT 
GCACCTCTGC 
TCGAAGCTTA 
TTGATCTGAT 
GAGAATTTGG 
CCGGTCCCCT 
ATGAATACAG 
CAAATCAGGG 
AACTGAATAA 
CTATCGTACG 
TCTATACTCT 
ACGGATCGGT 
TTTAATACCC 
GATCCGGATG 
AACAGTGGTG 
CTCACACTTT 
CATCAGCTGA 
ACAACGGCTT 



TTAATCAGAT 
GCCTGTTCTT 
ACTCGCTTTG 
TGCCAGAGGG 
CTGTAATTGG 
CGGTATCTCG 
TCCCTGACGT 
GAATACCCTG 
CGCAATAAAA 
CAGTCAGGAC 
ACACGCTGTT 
TAGAGCCGGT 
TATCCATCAC 
TTTATTCTAT 
GGCCAAATAC 
AAATACTTTA 
TAAGAATAGA 
GTTTGTAATC 
CCATCAAGTC 
ACAAAACAAT 
CCCAATCTGG 
TCAAATGAAG 
ATAAATATTT 
AATTACTCGT 
AACAAATTTG 
TAGGGCCGTT 
TATGTGGATT 
TTTTAAGGCA 
AAGTCAGTTA 
GCCATTTACA 
TGTGAATAGA 
CAACACTATT 
AAATCAGTCC 
TCAGTGAACT 
ATCATGAAAC 
GTGCCAACCC 
GCAGTTATGA 
CTTCCATGTT 
ATTTTTCAAG 
TGAGCCAGAG 
TGGAGCTATT 
TTACCGTCAG 
TATGACCCAT 
GGAAGGGGCT 
GACCGAAGAG 
TATCACGCCC 
TTCTGAGGTG 
TTATGTGGAT 
CAAAATAACA 
GTATGAAGGA 
GGCGACTCTT 
GGTAGCCAAT 
AAATGGCGTA 
CGGCGGAATA 
AGCCATTCGC 
CAGTGACAAT 
GTTCTACAGT 
CATTAATCAA 
CGCCGCTGAA 
AAGAACAATC 
AACTGTATCA 
CTGTCCCTGT 
CGGTTAATGA 
CTTTGTCTTC 
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7381 CGGGGAGTTG TCACGGCTGG TTATCTGGTT GTATCAGGTG ACGCAGTGGC TGACTGAGGG 

7441 CGGAAATCAC CACTGAAGCG ATCTGGTTAT TATGTACGCC AGAGTTCAGC GGGAATATTT 

7501 CACCGGAAAT CAGTAATCTG CTTAATACTC TCCGACCCCG TATTAGTGAA GACATGGCAC 

7561 AAAGTAGTGA CCGGGAGCTT CAGGCTGAAA TTCTCGCGCC GTTTATTGCT GCAACGCTGC 

7621 ATCTGGCGTC ACCAGATATG GCGCGGTATA TCCTGTTGTG GACTGATAAC CTGCGGCCGG 

7681 GCGGCCTGAA TATCGCCGGA TTTATGATGC TGGTGCTGAA AGAGACGCTG AGTGATGAGG 

7741 AAACGACCCA ACTGGTTCAA TTCTGCCATG TAATGGCACA GTTATCGCTT TCCGTGCAGA 

7B01 CACTGCGTCT CAGTGAAGCA GAGCTTTCTG TGCTGGTCAT TTCCGATTTT GTGGTACTGG 

7861 GTGCGAGAAG CCAACCGCCG GACAACACAA TATTGATACT CTGTTCTCAC TCTACCGATT 

7921 CCACCAGTGG ATTAATGGGC TGGGAAATCC CGGCTCTGAC ACGCTGGATA TGCTGCGCCA 

7981 AGCAGACACT CACGGGCGAC AGACTGGGCC TCCGTGATGG GGCTGGACAT CAGTATGGTA 

8041 ACGCAGGCCA TGGGTTCCCG CCGGCGTGAA CCAACTTCAG TGTTGGCAGG ATATCAACCC 

8101 CGTGTTGCAG TGGATACATG TGGCATCAGC ACTGCTCACT GATGCCGTCG GTTATCCGTA 

8161 CGCTGGTGAA TATCCGTTAC GTGACTGCAT TAAACAAAGC CGAGTCGAAT CTGCCTGCCT 

8221 GGGATAAGTG GCAGACGCTG GCAGAAAATA TGGCAGCCGG ACTG AGTACA CAACAGGCTC 

8281 AGACGCTGGC GGATTATACC GCAGAGCGCC TGAGTAACGT GTTGTGCAAT TGGTTTCTGG 

8341 CGAATATCCA GCCAGAAGGG GTGTCCCTGC ACAGCCGGGA TGACCTGTAC AGCTATTTCC 

8401 TGATTGATAA TCAGGTCTCT TCTGCCATAA AAACCACCCG ACTGGCAGAG GCCATTGCCG 

8461 GTATTCAGCT CTACATCAAC CGGGCGCTGA ACCGGATAGA GCCTAATGCC CGTGCCGATG 

8521 TG7CAACCCG CCAGTTTTTT ACCGACTGGA CGGTGAATAA CCGTTACAGC ACCTGGGGCG 

8581 GGGTGTCGCG GCTGGTTTAT TATCCGGAAA ATTACATTGA CCCGACCCAG CGTATCGGGC 

8641 AGACCCGGAT GATGGATGAA CTGCTGGAAG ATATCAGCCA GAGTCAGCTC AGCCGGGACA 

8701 CGGTGGAAGA GGCCTTTAAA ACTTACCTGA CCGCTTTGAA ACCGTGGCAG ACCTGAAAGT 

8761 TGTCAGCGCT ATCACCGACA ACGTCAACAG CAACACCGGA CTGACCTGGT TTGTCGGCCA 

8821 AACGCGGGAG AACCTGCCGG AATATTACTG GCGTAACGTG CATATATCAC GGATGCAGGC 

8881 GGGTGAACTG GCCGCCGATG CCTGGAAAGA TTGGACGAAG ATTGATACAG CGGTCAACCC 

8941 ATACAAGGAT GCAATACGTC CGGTCATATT CAGGGAACGT TTGCACCTTA TCGTGGGTAG 

9001 AAAAAGAGGA AGTGGCGAAA AATGGTACTG ATCCGGTGGA AACCTATGAC CGTTTTACTC 

9061 TGAAACTGGC GTTTCTGCGT CATGATGGCA GTTGGAGTGC CCCCTGGTCT TACGATATCA 

9121 CAACGCAGGT GGAGGCGGTC ACTGACAAAA AACCTGACAC TGAACGGCTG GCGCTGGCCG 

9181 CATCAGGCTT TCAGGGCGAG GATACTCTGC TGGTGTTTGT GTACAAAACC GGGGTGAGTT 

9241 ACCCGGATTT TGGCGACAAC AATAAAAATG TGGCAGGCAT GACCATTTAC GGCGATGGCT 

9301 CCTTCAAAAA GATGGAGAAC ACAGCACTCA GCGTTACAGC CAACTGAAAA ATACCTTTGA 

9361 TATCATTCAT ACTCAAGGCA ACGACTTGGT AAGAAAGGCC AGCTATCGTT TCGCGCAGGA 

9421 TTTTGAAGTG CCTGCCTCGT TGAATATGGG TTCTGCCATC GGTGATGATA GTCTGACGGT 

9481 GATGGAAAAC GGGAATATTC CGCAGATAAC CAGTAAATAC TCCAGCGATA ACCTTGCTAT 

9541 TACGCTACAT AACGCCGCTT TCACTGTCAG ATATGATGGC AGTGGCAATG TCATCAGAAA 

9601 CAAACAAATC AGCGCCATGA AACTGACGGG GTTGGATGAA AGTCCCAGTA CGGCAATGCA 

9661 TTTATCATCG CAAATACCGT TAAACATTAT GGCGGTTACT CTGATCTGGG GGGCCCGATC 

9721 ACCGTTTTTA TTAAAACGGA AAAACTATAT TGCATCAGTT CAAGGCCACT TGATGAACGC 

97 81 AGATTACACT AGGCGTTTGA TTCTAACACC AGTTGAAAAT AATTATTATG CCAGATTGTT 

9841 CGAGTCTCCA TTTTCTCCAA ACACAATTTT AAACACCGTT TTCACGGTTG GTAGCAATAA 

9901 AACCAGTGAT TTTAAAAAGT GCAGTTATGC TGTTGATGGT AATAATTCTC AGGGCTTCCA 

9961 GATATTTAGT TCCTATCAAT CATCCGGCTG GCTGGATATT GA CACA GGTA TTAACAATAC 

10021 TGATGTCAAA ATTACGGTGG TAGCTGGCAG TAAAACCCAC ACCTT TACGG CCAGTGACCA 

10081 TATTGCTTCC TTGCCGGCAA ACAGTTTTGA TGCTATGCCG TACACCTTTA AGCCACTGGA 

10141 AATCGATGCT TCATCGTTGG CCTTTACCAA TAATATTGCT CCTCTGGATA TCGTTTTTGA 

10201 GACCAAAGCC AAAGACGGGC GAGTGCTGGG TAAGATCAAG CAAACATTAT CGGTGAAACG 

10261 GGTAAATTAT AATCCGGAAG ATATTCTGTT TCTGCGTGAA ACTCATTCGG GTGCCCAATA 

10321 TATGCAGCTC GGGGTGTATC GTATTCGTCT TAATACCCTG CTGGCTTCTC AACTGGTATC 

10381 CAGAGCAAAC ACGGGCATTG ATACTATCCT GACAATGGAA ACCCAGCGGT TACCGGAACC 

10441 TC r GTTGGGA GAAGGCTTCT TTGCCAACTT TGTTCTGCCT AAATATGACC CTGCTGAACA 

10501 TGGCGATGAG CGGTGGTTTA AAATCCATAT CGGGAATGTT GGCGGTAACA CGGGAAGGCA 

10561 GCCTTATTAC AGCGGAATGT TATCCGATAC GTCGGAAACC AGTATGACAC TGTTTGTCCC 

10621 TTATGCCGAA GGGTATTACA TGCATGAAGG TGTCAGATTG GGGGTTGGAT ACCAGAAAAT 

10681 TA r CTATGAC AACACTTGGG AATCTGCTTT CTTTTATTTT GATGAGACAA AACAGCAATT 

10741 TG~ATTAATT AACGATGCTG ATCATGATTC AGGAATGACG CAACAGGGGA TCGTGAAAAA 

1O801 TATCAAGAAA TACAAAGGAT TTTTGAATGT TTCTATCGCA ACGGGCTATT CCGCCCCGAT 

10861 GGATTTCAAT AGTGCCAGCG CCCTCTATTA CTGGGAATGT TCTATTACAC CCCGATGATG 

10921 TGCTTCCAGC GTTTGCTACA GGAAAAACAA TTCGACGAAG CCACACAATG GATAAACTAC 

10981 GTCTATAATC CCGCCGGCTA TATCGTTAAC GGAGAAATCG CCCCCTGGAT CTGGAACTGC 

11041 CGGCCGCTGG AAGAGACACT CCTGGAATGC CAATCCGTTG GATGCCATTG ATCCGGATGC 

11101 CGTCGCACAA TATGACCCGA CACACTATAA AGTTGCCACC TTTATGCGCC TGTTGGATCA 

11161 ACTTATTCTG CGCGGCGATA TGGCCTATCG CGAACTGACC CGCGATGCGT TGAATGAAGC 
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CAAGATGTGG 
CCAACAGTGG 
TCAACAAGAC 
CTCGTTGGTG 
TGCGTTTGCG 
GGCGAATTAC 
AGGGCGGTAG 
AGCGGGCCCG 
CAGAGCATGA 
CGACACAGAG 
TATTGGCAGA 
ACGAGGATAT 
AGTCTCTGGC 
TGTTCGGTTT 
TGATGTCGCT 
CCTACCGCCG 
AACAAATGGA 
TGGAATATCA 
AATTCACAAA 
AGTTCTTTGA 
TGACCGACAA 
TGATGGCGGG 
GTGATGAGCG 
TATCATCAGA 
GCAACGTAGG 
TGCGATTGTC 
AGTTGAAACA 
CGGTGCTGAA 
CCCACGGCGT 
CGTTTGAAGG 
CTGATCGACA 
CCATTCGTTC 
GTTTTTATGC 
GGATCACTAA 
TTTCACTGCC 
ACAGCAGTAC 
TCAGCCTGCG 
CGGATGGGGA 
CAACCTCACT 
GCGTGGCAGA 
AGACGTCTTT 
ATCATGCACG 
AAACCGTCAC 
ACTGTGATGA 
AGTCCACTAT 
TGTTGATAAT 
GAACTCCGTA 
AAAATGGCGT 
CCGTCGCTTG 
GGTTGCAGAA 
CAAGGTTTCC 
GATGATGTCC 
GCAGTCCATG 
ATATGGAGAA 
GTGCTCCGGT 
AACCACTGCC 
ACGGGCGTCT 
CGGAAGGTGA 
CGCAGGCAAA 
CAAATAGTGT 
TTCATTTGTC 
TCAGTGATAT 
GCTACTGGCC 
AAATTACGGG 



TATGTGCGTG 
GCCGCACCGT 
CTTA CGGCGC 
GTTTGGTCCT 
CCTGGTTAAC 
GCGAGCCTAC 
TGCAGTGCTG 
CAATCTGGTA 
TGATGCCGAT 
CATCCGTATT 
GAGCCGCCGC 
CAACCACGGA 
CGGGCAGGCG 
CGCTTGTGGC 
TTCTGCCACA 
CCGCCGTCAG 
TGCCCAGCTG 
GGAGACCCAG 
CAAAGCGCTT 
CCTGACCCAG 
CGGTGTTACC 
TGAAACGTTG 
GG C ACTGG AA 
CAACTTTAAT 
AGCTTCCGGC 
TGATTTGAAA 
AGTGAGTGTC 
TTACGGCGGC 
GAATGACAGT 
TATTTCCGTG 
GAAAGCGCTG 
TTAATTAAAA 
AGGGTTCAAC 
AAGGAATGGG 
CTTGCCGATC 
TGCTGGCAAT 
TACCGCCAAG 
AGTGTTGAGT 
GTTGGGGACG 
AAAAATCGTT 
TTGGGTACTT 
TATTGCTGAC 
GCATACCGGG 
GCATGAACTT 
GGCAATACTC 
GACTGGTTGT 
CCCGAATTCA 
TGTCGTCCGG 
TGTCGCCAAG 
GAAACACCGG 
TTGCTGCAAA 
CCGCTGGAAA 
CCGCAGTTAG 
GGAATTTCCG 
ACGGGATATC 
ACATATTCCG 
GGATTGGGTG 
ATGGACACCC 
ACTGGCTGAT 
ACGTGTCTGG 
AAATAAGCCA 
GACAGGCTCC 
GAACCTGGGG 
GAAACGTTTA 
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CTTTGGAATT 
CTCTTTCCGT 
TAGACAACGG 
GCCGGAATAT 
CTGCGCCATA 
GATCCGAAAG 
CCCGGCACAT 
GCGCAATTAA 
GAACTCACCA 
CAGCAACGAA 
AGTGCACAAA 
GAACAGCGTG 
CTCTCAGTAG 
GGCAGTCGTT 
GCTTCCCAAT 
GAGTGGGAAA 
GAAAGCCTGA 
CAGGCCCATA 
TACAGTTGGA 
TCCTTCTGCC 
TTTATCCGGG 
CTGCTGAATC 
GTGACCCGTA 
CTGACCGAAA 
AATGAATTAA 
ATTTTCAG CG 
ACCTTGCCGG 
AGCATCGTCA 
GGTCAATTTA 
AATGACAGCG 
CTGGAGAGCC 
CATTGTGATA 
ACCTTTGAAA 
AGAAGCACTC 
TCTGTCCGGC 
GGGTCATTCG 
GGCGTTCCGC 
ATTGTGCCGG 
GTTCTGACAC 
CGTTTAGAAC 
TTTACTGCGG 
CCGCAGGATG 
GAACATATTT 
GCTCAGCATT 
AGCCGGAAAC 
TTCATCTGGT 
ATGTGTCAGA 
ACAGTTTCTC 
TTCTGATGTT 
CGCTGGTTTC 
CGGCCCGCAG 
TGGATTATCA 
AAAAAATGAA 
GCGTTACTTT 
ACTGCCGAAG 
GCACAACAGG 
ATTACGGCAT 
TTTATTCCAT 
ATTGATGGGG 
TCAAATAATC 
CTGCCGGTTC 
GGGCAATCAC 
CATGGAAAAT 
ACCCCCACAG 



GCTGGGTGAT 
GGCGGGCAAC 
AGAAGGTTGC 
AACCCGGAAT 
ATCCTTCCAT 
CGCTGCTCAC 
TGTCGTTATA 
CCCAGTTCGG 
CGTTGCTACT 
CTGTCGATGA 
ATCGTCTGGA 
CGATGTCACT 
CAGAAGGGGT 
GGGGGGCAGC 
ATTCCGCAGA 
TTCAGCGTGA 
AAATACGCGG 
CTCAGGCTCA 
TGCGCGGCAA 
TGATGGCACA 
GTGGGGCCTG 
TGGCAGAAAT 
CCGTCTCGTT 
AACTCACGCA 
AACTCAGTAA 
ATACCCCGGA 
CGCTGGTTGG 
TGCCACGCGG 
TGCTGGATTT 
GTAGCCTGAC 
TGAGCGATAT 
GGCAGGCTCC 
CTTGAAATAC 
AATGCCGTCG 
GTGGTCTGGT 
GGATGGGGTG 
ACTATACGGG 
ACAGCCAAGG 
AGCCGCCTAC 
ACTGGCAGCC 
ATGGTTTAGT 
AAACCAGAAT 
ACTATCACTA 
CA GGTGTTA C 
CGCTTTTTTC 
ATTTGATTAC 
AAACAATGTG 
CCGCTATGAA 
TCATCAGCTG 
CCGTCTTATT 
ACTGGCCCAT 
ACGTGTTAAT 
CACGTTGCAG 
ATCAGGATAC 
GAACGAATGC 
AAAGCGCGAT 
CAGGGTTACG 
TATCCGCTGT 
CTGGGCTGCC 
CGGCAGGATG 
CCGGCAAAAA 
ATCTGGTGGA 
TTGGTGAGCC 
ACTGTATATG 



GAGCCGGAGG 

CACACTGTGC 

ACTCAACCCC 

CAACCGATTA 

GACGGGCAAC 

CAGTATGGTA 

CCGCTTCCCG 

CACCTCTCTG 

ACAGCAGGGT 

AGTGGATGCT 

AAAATACCAG 

GTTTGATGCG 

GGCTGACTTA 

ACTGCGTGCT 

CAAAATCAGC 

TAATGCTGAC 

CGAAGCAGCA 

GTTAGAGCTG 

GCTGAGTGCT 

GGAAGCGCTG 

GAACGGTACG 

GGAAAAAGTC 

GGCACAGTTC 

ATTCCTGCGT 

CCGCCAGATA 

AAGCTTTGGC 

TCCGTATGAA 

TTGCAGTGCT 

CAACGATTCC 

GTTGAGTTTC 

CATTCTGCAT 

TGAGGGAGCC 

CGTCATTGCC 

GAGCGGAAGG 

GCCGGTGCTA 

GCAATGTGGG 

ACAAGATGAG 

GCAACCAGAG 

TGTTACCCGC 

ACAGCAGAGA 

GCACCTATTC 

TGCCCGCTGG 

TCGGGCAGAA 

GGCCCACCGT 

GCGGTAAAAT 

GGTGAGCGCT 

TCTGAAAACA 

TATGGGTTTG 

AAAGCGCTGG 

CTGGATTATG 

GAAACGGACG 

CATGGCGTGA 

CCATACCAAT 

TCAGAAAGCC 

GGTTACCTAT 

GTTGTTGGAC 

GGGCTACCAC 

GCCAATGGAA 

TGACTTAGCG 

GGATCGCGCT 

TAAGCGTCAT 

AGTTACGGCA 

TCTGATGATA 

GTAGACCTAA 



ATTACGGCAG 

AAGCGGGCTA 

GCAACGCTAA 

CTGGCAAACC 

CGTTATCGCT 

CAGCCTTCTC 

GTGATGCTGG 

CTCAGTATGG 

ATGGAACTGG 

GATATTGCTG 

CAGCTGTATG 

GCGGCAGGTC 

GTTCCAAACG 

TCCGCCTCCG 

CGTTCGGAAG 

GGTGAAGTCA 

CAGATGCAGG 

TTACAGCGTA 

ATCTATTACC 

CGCCGCGAGC 

ACTGCGGGTT 

TGGCTGGAGC 

TATCAGGCCT 

GAAGGGAAAG 

GAAGCCTCAG 

AATACCCGTC 

GATATCCGGG 

ATTGCTCTCT 

CGTTATCTGC 

CCGGATGCGA 

ATCCGCTATA 

TGTTTAAGGA 

CTCTGGGGGC 

GGAGCGTCAT 

TCACTGAATT 

GTTGGTTTTA 

TATCTCGGGC 

CAACGCACCG 

TATCAGTCCC 

CGTGAGGAAG 

GGTAAGCATC 

CTGATGGAGG 

GACGATCTTG 

TATCCTGGCA 

CAGGTATCCC 

TATCTTCGCT 

ATGTGTCTGA 

AAATTCGAAC 

CAGGGGAAAA 

ACCTGAACAA 

GTACGCCAGT 

ATCTGAACTG 

TGGTTGATTT 

TGGTGGTACC 

GAGGAGGCGA 

ATCAATGGTG 

ACCATGTCAC 

TATTTCCATC 

CTTATCGGGC 

CAGGATGTTA 

CTTGTCGCAT 

AATAGCGTGC 

ACAGGCTTCC 

ATGGCTCAGG 
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Fig.2. 

15061 CACCACCCGA TTTTATTTAT GCCCGCAATA CTTACCTTGA ACTCTATGCC AATGAAA6CG 

15121 GCAATCATTC TGCTGAACCT CAGCGTATTG ATCTGCCGGA TGGGGTACGT TTTGATGATA 

151 81 CTTGTCGGTT ACAAATAGCG GATACACAAG GATTAGGGAC TGCCAGCATT ATTTTGACGA 

15241 TCCCCCATAT GAAGGTGCAG CACTGGCGAT TGGATATGAC CATATTCAAG CCTTGGCTGC 

15301 TGAATGCCGT CAATAACAAT ATGGGAACAG AAACCACGCT GTATTATCGC AGCTCTGCCC 

15361 AGTTCTGGCT GGATGAGAAA TTACAGGCTT CTGAATCCGG GATGACGGTG GTCAGCTACT 

15421 TACCGTTCCC GGTGCATGTG TTGTGGCGCA CGGAAGTGCT GGATGAAATT TCCGGTAACC 

154 81 GATTGACCAG CCATTATCAT TACTCACATG GTGCCTGGGA TGGTCTGGAA CGGGAGTTTC 

15541 GTGGTTTTGG GCGGGTGACG CAAACTGATA TTGATTCACG GGCGAGTGCG ACACAGGGGA 

15601 CACATGCTGA ACCACCGGCA CCTTCGCGCA CGGTTAATTG GTACGGCACT GGCGTACGGG 

15661 AAGTCGATAT TCTTCTGCCC ACGGAATATT GGCAGGGGGA TCAACAGGCA TTTCCCCATT 

15721 TTACCCCACG CTTTACCCGT TATGACGAAA AATCCGGTGG TGATATGACG GTCACGCCGA 

15781 GCGAACAGGA AGAATACTGG TTACATCGAG CCTTAAAAGG ACAACGTTTA CGCAGTGAGC 

15841 TGTATGGGGA 7GATGATTCT ATACTGGCCG GTACGCCTTA TTCAGTGGAT GAATCCCGCA 

15901 CCCAAGTACG TTTGTTACCG GTGATGGTAT CGGACGTGCC TGCGGTACTG GTTTCGGTGG 

15961 CCGAATCCCG CCAATACCGA TATGAAGGGG TTGTTACCGA TTCCACAGTG CAGCCAAAAG 

16021 ATTGTCCTTA AATATGATGC GTTAGGATTT CCGCAGGACA ATCTTGAGAT TGCCTATTCG 

16081 AGACGTCCAC AGCCTGAGTT CTCGCCTTAT CCGGATACCC TGCCCGAAAC ACTTTTCACC 

16141 AGCAGTTTCG ACGAACAGCA GATGTTCCTT CGTCTGACAC GCCAGCGTTT TTCTTATCAC 

16201 CATCTGAATC ATGATGATAA TACGTGGATC ACAGGGCTTA TGGATACCTC ACGCAGTGAC 

16261 GCACGTATTT ATCAAGCCGA TAAAGTGCCG GACGGTGGAT TTTCCCTTGA ATGGTTTTCT 

16 321 GCCACAGGTG CAGGAGCATT GTTGTTGCCT GATGCCGCAG CCGATTATCT GGGACATCAG 
16381 CG7GTAGCAT ATACCGGTCC AGAAGAGCAA CCCGCTATTC CTCCGCTGGT GGCATACAT7 
164 41 GAAACCGCAG AGTTTGATGA ACGATCGTTG GCGGCTTTTG AGGAGGTGAT GGATGAGCAG 
16501 GAGCTGACAA AACAGCTGAA TGATGCGGGC TGGAATACGG CAAAAGTGCC GTTCAGTGAA 
16561 AAGACAGATT TCCATGTCTG GGTGGGACAA AAGGAATTTA CAGAATATGC CGGTGCAGAC 
16621 GGATTCTATC GGCCATTGGT GCAACGGGAA ACCAAGCTTA CAGGTCAAAC GACAGTGACG 
16681 TGGGATAGCC ATTACTGTGT TATCACCGCA ACAGAGGATG CGGCTGGCCT GCGTATGCAA 
16741 GCGCATTACG ATTATCGATT TATGGTTGCG GATAACACCA CAGATATCAA TGATAACTAT 
16801 CACACCGTGA CGTTTGATGC ACTGGGGACG GTAACCAGCT TCCGTTTCTG GGGGACTGAA 
16861 AACGGTGAAA AACAAGGATA TACCCCTGCG GAAAATGAAA CTGTCCCCTT TATTGTCCCC 
16921 ACAACGGTGG ATGATGCTCT GGCATTGAAA CCCGGCATAC CTGTTGCAGG GCTGATGGTT 
169B1 TATGCCCCTC TGAGCTGGAT GGTTCAGGCC AGC7777C7A ATGATGGGGA GCTTTATGGA 
17041 GAGCTGAAAC CGGCTGGGAT CATCACTGAA GA7GG77A7C TCCTGTCGCT TGCTTTTCGC 
17101 CGCTGGCATC AAAATAACCC TGCCGCTGCC A7GCCAAAGC AAGTCAATTC ACAGAACCCA 
17161 CCCCATGTAC TGAGTGTGAT CACCGACCGC 7A7GA7GCCG ATCCGGAACA ACAATTACGT 
17221 CAAACGTTTA CGTTTAGTGA TGGTTTTGGG CGAAACCT7A CAAACAGCCG TACGCCATGA 
17281 AAGTGGTGAA GCCTGGGTAC CTGATGAGTA 7GGAGCCAAT GTGGCTGAAA ATCAAGGCGC 

17 341 CCCTGAAACG GGCGATTACA AATTTCCCG7 TGGGCAATTT CCCGGACGTA CAGAATATTA 
174 01 ACGGGAAAAG GCAAAGCCCC TGCGTTACG7 77CAAACCGT ATTCCTGAAA TAATTTGGGC 
174 61 AACTATGTCA AGTTGACCAA AAAATGCCCG GCAGGA7A7G TATGCCGATA CCCATTACTA 
17521 TGATCCGTTG GGGCGTGAAT ATCAGGTTAT CACGCCAAAG GCGGGTTGCG TCGATCCTTA 
17581 TTCACTCCCT GGTTTGTGGT GAATGAAGTT GAAAATGACA CTCCCGGTGA ATGACAGCA7 
17641 AAAGCTCAGT GATGCCTGTT CACTGAACAG ACATCACTCC ATTTAGGAAT GAATCATGAA 
17701 GAATTTCGTT CACAGCAATA CGCCATCCGT CACCGTAC7G GACAACCGTG GTCAGACAGT 
17761 ACGCGAAA7A GCCTGGTATC GGCACCCCGA TACACC7CAG GTAACCGATG AACGCA7CAC 
17821 CGGTTATCAA TATGATGCTC AAGGATCTC7 GAC7CAGAGT ATTGATCCGC GATTTTATGA 
17881 ACGCCAGCAG ACAGCGAGTG ACAAGAACGC CAT7ACACCC AATCTTATTC TCTTGTCATC 
17941 ACTCAGTAAG AAGGCATTGC GTACGCAAAG 7G7GGATGCC GGAACCCGTG TCGCCCTGCA 
18001 TGATGTTGCC GGGCGTCCCG TTTTAGCTGT CAGCGCCAAT GGCGTTAGCC GAACGTTTCA 
18061 GTATGAAAGT GATAACCTTC CGGGACGA77 GC7AACGATT ACCGAGCAGG TAAAAGGAGA 
18121 GAACGCCTGT ATCACGGAGC GATTGAT77G G7CAGGAAA7 ACGCCGGCAG AAAAAGGCAA 
18181 TAATTTGGCC GGCCAGTGCG TGGTCCATTA 7GA7CCCACC GGAATGAATC AAACCAACAG 
18241 CATATTGTTA ACCAGCATAC CCTTGTCCA7 CACACAGCAA. 7TAGTGAAAG ATGACAGCGA 
18301 AGCCGATTGG CACGGTATGG ATGAATTTGG CTGGAAAAAC GCGCTGGCGC CGGAAAGCTT 
18361 CAC7TCTGTC AGCACAACGG ATGC7ACCGG CACGGTATTA ACGAGTACAG ATGCTGCCGG 
18421 AAACAAGCAA CGTATCGCCT A7GATG7GGC CGG7CTGC7T CAAGGCAGTT GGTTGGCGC7 
184 81 GAAGGGGAAA CAAGAACAAG 7TATCG7GAA A7CCCTGACC 7ATTCGGCTG CCAGCCAGAA 
18541 GCTACGGGAG GAACATGGTA ACGGGATAG7 GACTACATAT ACCTATGAAC CCGAGACGCA 
18601 ACGAGTTATT GGCATAAAAA CAGAACG7CC T7CCGGTCAT GCCGCTGGGG AGAAAA7TTT 
18661 ACAAAACCTG CGTTATGAAT ATGATCC7G7 CGGAAATGTG CTGAAATCAA CTAA7GATGC 
18721 TGAAATTACC CGCTTTTGGC GCAACCAGAA AA7TG7ACCG GAAAATACTT ACACCTATGA 
18781 CAGCCTGTAC CAGCTGGTTT CCGTCAC7GG GCGTGAAATG GCGAATATTG GCCGACAAAA 
18841 AAACCAGTTA CCCATCCCCG C7CTGA77GA 7AACAATACT TATACGAATT ACTCTCGCAC 
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Fig.2. 

18901 TTACGACTAT GATCGTGGGG GAATCTGACC AGAATCGCAT AATTCACGAT CACCGGTAAT 

18961 AACTATACAA CGAACATGAC CGTTTCAGAT CACAGCAACC GGGCTGTACT GGAAGAGCTG 

19021 GCGCAAGATC CCACTCAGGT GGATATGTTG TTCACCCCCG GCGGGCATCA GACCCGGCTT 

19081 GTTCCCGGTC AGGATCTTTT CTGGACACCC CGTGACGAAT TGCAACAAGT GATATTGGTC 

19141 AATAGGGAAA ATACGACGCC TGATCAGGAA TTCTACCGTT ATGATGCAGA CAGTCAGCGT 

19201 GTCATTAAGA CTCATATTCA GAAGACAGGT AACAGTGAGC AAATACAGCG AACATTATAT 

19261 TTGCCAGAGC TGGAATGGCG CACGACATAT AGCGGCAATA CATTAAAAGA GTTTTTGCAG 

19321 GTCATCACTG TCGGTGAAGC GGGTCAGGCA CAAGTGCGGG TGCTGCATTG GGAAACAGGC 

193 81 AAACCGGCGG ATATCAGCAA TGATCAGCTG CGCTACAGTT ATGGCAACCT GATTGGCAGT 

19441 AGCGGGCTGG AATTGGGACA GTGACGGGCA GATCATTAGT CAGGAAGAAT ATTACCCCTA 

19501 TGGGGGAACC GCCGTGTGGG CACCCGAAAT CAGTCAGAAG CTGATTACAC AAGCCGGCGT 

19561 TATTCTGGCA AAGAGCGGGA TGCAACAGGG TTGTATTACT ACGGCTATCG TTATTATCAA 

19621 TCGTGGACAG GGCGATGGTT GAGTGTAGAT CCTGCCGGTG AGGCCGATGG TCTCAATTTG 

196 81 TTCCGAATGT GCAGGAATAA CCCCATCGTT TTTTCTGATT CTGATGGTCG TTTCCCCGGT 

19741 CAGGGTGTCC TTGCCTGGAT AGGGAAAAAA GCGTATCGAA AGGCAGTCAA CATCACGACA 

19801 GAACACCTGC TTGAACAAGG CGCTTCCTTT GATACGTTCT TGAAATTAAA CCGAGGATTG 

19B61 CGAACGTTTG TTTTGGGTGT GGGGGTACAA GTCTGGGGGT GAAGCGGCCA CGATTGCAGG 

19921 AGCGTCGCCT TGGGGGATCG TCGGGGCTGC CATTGGTGGT TTTGTCTCCG GGGCGGTGAT 

19981 GGGG TTTTT C GCGAACAACA TCTCAGAAAA AATTGGGGAA GTTTTAAGTT ATCTGACGCG 

20041 TAAACGTTCT GCTCCTGTTC AGGTAGGCGC TTTTGTTGTC ACATCGCTTG TGACGTCTGC 

201 CI ACTATTTAAC AGCTCTTCGA CAGGTACCGC CATTTCCGCA GCAACAGCGG TCACCGTTGG 

20161 AGGATTAATG GCTTTAGCCG GAGAACATAA CACGGGCATG GCTATCAGTA TTGCCACACC 

20221 CGCCGGACAA AGTACGCTGG ATACGCTCAG GCCCGGTAAT GTCAGCGCGC CAGAGCGGTT 

2 02 ei AGGGCACTAT CAGGCGCAAT TATTGGCGGC ATATTACTTG GCCGCCATCA GGGAAGTTCT 

20341 GAGCTGGGTG AACGGGCAGC GATTGGTGCT ATGTATGGTG CTCGATGGGG AAGGATCATT 

20401 GGTAATCTAT GGGATGGCCC TTATCGGTTT ATCGGCAGGT TACTGCTCAG AAGAGGCATT 

20461 AGCTCTGCCA TTTCCCACGC TGTCAGTTCC AGGAGCTGGT TTGGCCGAAT GATAGGAGAA 

20521 AGTGTCGGGA GAAATATTTC TGAAGTATTA TTACCTTATA GCCGTACACC CGGTG AATGG 

20581 GTTGGTGCAG CCATTGGCGG GACAGCCGCG GCCGCTCATC ATG CCGTT GG AGGGGAAGTT 

20641 GCCAATGCCG CTAGCCGGGT TACCTGGAGC GGCTTTAAGC GGGCTTTTAA TAACTTCTTC 

20701 TTTAACGCCT CTGCACGTCA TAATGAATCC GAAGCATAAC AATCATGTTC ATTCCCACTT 

207 61 TG7CATGGAT GACAAGGTGG GTTTTTCGGA TGTGTGGACA GAGACCCGTA CAGGGTCTCT 

20E21 GTCCAGTTAA TTTTTGGATC AAGAACGAA7 GGTGTAACGG ATATGCAAAA TGATATCGCT 

2086 1 CAGGCTGAGC AATAAGCTTT TCTGTTTACC ACTGATACCG GGAAAACTGA GGGTTAATGT 

2 0 941 GCCTGTATCG GCCACAGGAA GCCCTTCAAA TGGCAGGTAC TTAGCATCAT TGAAATCCAT 

21001 CTGGAATTGA CCACTGTCAT TCATGCCATG TGAGATCACA ATCGCTTTGC AGCCACGTGG 

21061 CATCATTGTA CTGCCGCCAT AACTCAGTAT TGCCCGGACA TCCTGATAAG GCCCTAAAAG 

21121 GGCAGGTAAC GTCACACTGA TTTGTTTGAT ACGGCGTGTA TTACCTAAAC CGTCAGGATA 

21181 ATCGGTAGCA ATATTCAGAT CCGATAATTT GAGGCTGGCT TGCAGTTGTG TCCCTTCGAC 

21241 GTTCAAACCG TTAAGCGTTG TGCCTGCACT GCCTTCACCT GCATTGACTA ACTCAGTCAC 

213 01 TTTATCTTTT AAAATGAAAC TATTTTCTGT CAGACCAGCA TACACTTCAG CCAGAGAAAC 
21361 GGTTCTGGTG ACCTCCAGTG CCCGTTCATC TTTTTCCAAA TAGCTTTTTT CCATCTGTGC 
21421 TAAATTCAGC ATCAGGGTTT CACCCGCTAA TAAACCCGCA TAAGTCCCAT GCCAAGCACC 

214 81 TGGTTTAATA AAGTGTGCTG CCGCATTATT CAATTCATAC TGATAAGTTT GCTCTGCCAT 
21541 TAAACAGAGT GAGACCGCCA AATCATAAAA CTGATAATAA ATAGCGGACA ACGTTCCACG 
21601 GAGCCAGTTG TAT AG CGCTG CATTACTGAA TTTACTTTGC AGAAAGGCTA ACTGCGCCTG 
21661 AGTTTGTGCC TGCTGAGTTT CCAGATAGTT TTTTTGTAAT ACTGCCGCTT CACGACGTAC 
21721 AGCCAGCGTC GCTAATTGAG CATCAATTTG TTTTATCTCA GCTTCCGCAT TATTG CGCTG 
21781 AATTTCCCAC TCTTGCCGAC GGCGACGGTA TATTTCTGAT TGGCTGATTT TGTCTGCGGC 
21841 AATACGTGTT GCTGACGCAG AAATTTCGAT ACCAATCGCA CTGGCATTGA AAAGCGCCCC 
21901 AAAACGGGAA CCTCCCACAG CAAAACCGTA AATATTGGGG ACGAGATCTG CCGCGGCGGC 
21961 GGCCATATGC AGGGCTGTGC CGCTGGTGCT CAAGACCGAT GAAGAGAGGT AAAGATCCAT 
22021 CGCTTGTTTT TCACCAGCGT TAACATCTTC GTCGTACAGC G TATTG AAA C TGTCAAAACG 
220 81 AGACTGTGCA CCAT GACGG C TT TCTT GAAG CGCCAATTTA TCAGCATCAA TTTCAGCCAT 
22141 GACCTTATCC TGCATTTTAA TACTTTGCAG GGCTAACTCA CTGCCTTGAG TTTGCAGTAT 
22201 TTCAGCCAAG GCTTCTGCAT CCTGCCGTTC AGTAATGCTG AGCAGGGTAT TGCCAAATTG 
22261 TATCAACTGG CTTACCCCCC ACTTGGCATT TTCCAGAATC ACCGGAAAAC GGTACATCGG 
22321 CATCACTGCA TGAGGTAAAT CGCCGCCGCC TTGTGAAGCA GTGATGGCAG CACTGAGTAA 
22381 CATGGACGGA TCTGCGGGCG TGGCATAGAG AGATAATGAC AGTGGCTGAC CGTCGATTGT 
22441 CAGGTTATGG CGTAAGTTAT AGAGGCGTTG CGTCAATGTC TGCCAGTAAC CTTGCAGTTT 
22501 TTTATTAATT TGAGGGAGGA ACAATGCGGT TAACGAAATT TGCCGTACGT TTCGTGGGTA 
22561 ATGCAGCGCG CTGACGCAGT TGCAGCATTT TATGTTGATA ATGATGCCGC ATTGT7TGGC 
22621 TGGCAGCTTC TTCCAGCCGT GGCTCTGACC AATCGTTATC CAATGAAAAA TAAGGCTCAT 
2268 1 CACCCAATAA AGTGAGCGCC TGTACATACC ACATTTTAGC TTCGTTTAAG GTATCACGTT 
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CAAGCT GGCG 
TAGCCACTTT 
GCGGATTGGC 
GCATTTCACC 
TCCGGTAAAA 
CAATCCCAAG 
TCATGACACG 
TGTCATTCCC 
AAAATAAAAG 
TAATCTGACA 
GGTTTCAATA 
ATTCCCAATA 
CAACTTAAAT 
CATACTTAAA 
AGTATTTAAC 
ATGCAGTGAT 
AATAACAATA 
AATATCATAG 
TTCGTGTGAA 
GCC ATTT TTT 
AAGTTTTATT 
AGTGCCGCAA 
ATAGTATGCC 
AATTCAGCAT 
GGAA TTTTTT 
TCGATGGAAA 
ATTTCATCAT 
AGGTATTTAA 
TTGAACGTTA 
CCGGTAAAAC 
TCTGTAGCAA 
CTCTTAACAA 
ATGGTCATCC 
AAA7AAACAA 
TTTTTATTGA 
CTGCCATCAT 
TCTTCACTTT 
TAAATAACAG 
TCTCCCCAGG 
CAATAATATA 
ACATTAATAC 
GTCAGATAAT 
TGCTGTAACA 
TTTTCCGGAT 
G7TTCCCAGT 
TTATTCAACG 
GTTTTCACTT 
CTCTTAATCT 
GCTTCATCCA 
AAAGTACTGG 
TATTTTAATT 
TCCAGCCATT 
ACCGGTCTGT 
AATTGTTCGG 
TCACAACGCA 
TGCA GTGCTG 
TTTTCGCTGA 
GCCACCATGT 
TCATCGAATG 
GTTTTGGCTG 
TCGTCATCAC 
ATATCCGGCG 
GACCATTTCT 
AA73CTGTAT 



ATAGGCGCTA 
ATAGTGCATC 
ATTCCAGGAC 
CTGAACCGAA 
TCTGCTCTTG 
AAATAGATTG 
ACTTGAATAC 
GAATCATGAT 
AAAGCAGATT 
CCTTCACGTA 
TCAGATAACA 
TGGATCTTAA 
GTCGCATAAA 
ACATTATCAA 
CGGGTTCTGT 
ATAACGTTAC 
TCGTTATAAC 
CCTTCAATAT 
GGTTTAGATG 
AATAAAAAAC 
ATCTGTGGCT 
TATTTCCCAT 
ATACTCCTGT 
CATCTGATTC 
CCTTGGTTCT 
TAATAAAATC 
GTCTATCATA 
CCTCATCATT 
AATTAATATG 
TGGCTAATTT 
ATTGATTGTT 
TGGCGTCTAA 
CTTCTCTTGC 
TAATGACGTC 
TCAGGTTTTC 
AACGAATATG 
CATTTTTCAG 
GTCTGATATT 
CATTGGCAGC 
TTCTGGGTTC 
TGTCATGATA 
TTTTAAAGCT 
GGTTGTTCAT 
AATAGGCCAG 
CGCAGAAGAA 
CCCGGTTGAC 
GGGCAGAAAC 
GTTGAGGTGC 
GCCATGCCTG 
CGGCTTGCCA 
TTATGAGTGC 
GCAGAGTGAC 
TGCAATGCTT 
CAGTCAACGC 
TGATCACAGC 
TGGTTTCTGA 
GTCCAATATT 
TGCTGGTTTC 
TCAGTCCTTG 
AATCCATTTG 
CGAGTGAAAG 
TAAGGACAGT 
GTGTTGTCAG 
CAGAAAAAAG 
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TCTCCGCGGG 

GGATCATGCT 

GTATCTTCCT 

TATCCGGTCG 

CAATAAG CGC 

CATTGGCGCC 

CCCTTTTATA 

CGGCATCATT 
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ACCAACGTTC 

ACCCTTCACC 

TACCAATATT 
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TTGTATCTTT 

CGC CGTCGG G 

CAACTTTTAC 

CCAC ATGGTC 

TAATGTTTTT 

GGTTGAACAT 

GTTATTAATG 

GTTATCTTTC 

ATAATCATAA 

TAGATGCATT 

CAAAGTTCCA 

ATCAAATAAA 

ATATATATTG 

ACCATTTCCT 

ATTTTTTGTG 

GACTTTGTAT 

ATCATTTTCT 

AGGAA GACTA 

TTTTTCATAA 

TATTTTATCA 

TGACAGTTTT 

CTCTTTTTGT 

TTCCTGCCAT 

AAATTGACCG 

TGTCTGGCTA 

TCCGCTAATC 

ATCTTCAACG 

CATACCTGTC 

TTCAGATACG 

CTGACGGGTT 

ATATAACTGA 

TTGGTTATCA 

ACCATTTTTG 

AAGCTGGTCG 

ATCATCAAAT 

AGCAACACCA 

ATCTATAAGT 

GTGTCACAAC 

TCCTAAGTTC 

ATGGAAGCGG 

TTGGAATTTC 

GCGCACAATC 

ATTCTCTGAG 

TGGTTTTATC 

AATGCTGGCA 

TGTTGATAAT 

GCTGTAATTA 

CCACTGGGTG 

GGCAATTTTC 



TAATCAACAA 

GGGCAACGGC 

CCAATGGGCG 

GGTTCAGATA 

TGGAATACCA 

GTTTGAAATC 

TTTTTTGATA 

AGTGAATATA 

GTCATAGATA 

TTTAGCATAG 

ATAAGGTTGT 

ATCACCATGC 

TAATTGCGGC 

GGCTCTTTCA 

CTGCATATAT 

GGATTTTAGT 

TTGCTTAATA 

TTGATTAAAA 

TTCAGCATTT 

ATCTTGGATC 

AAATACACCC 

ATTGAAACAT 

CAATCTAATA 

TTTATACCAA 

AACACTCTAA 

TAATGAAAAA 

ATAACCGTTT 

CCTTTTGAAA 

GGTGATATAT 

GTTATAGATT 

TCTGTCCTGG 

GTGAGAATGG 

T7AAAAGAAT 

TCAGAAGAAC 

GTCACATTAA 

AATATATAAT 

TCCAGCCACA 

ACATTGATGG 

TGCTGGCACT 

TAACCAATTA 

ACCTGCAAGT 

GTATCGATAT 

TGACCAATAC 

CCGGCCCAGG 

TTCACTGGCT 

ATGCTGGCAA 

ATCAGCAGAT 

ATGTAGTAAG 

GATTGTTGAC 

GTTGGCATCG 

TCCGGGGTAA 

TCTCCAGTTG 

CTGAGCATCA 

CAAATGCTGT 

GTCAGCGCTT 

TCCGGTTTTG 

AGAGAAAGTT 

CGATCACGGT 

TGATTAATCC 

GCAATCAGCG 

CCATTACTTA 

TCCGTGGTCA 

CATTGGAACA 

GTGTTCACAT 



ATCCAGCATT 

GTCCGGATCG 

GACGTTCCAG 

TAGCGCAGCC 

TCATGGGCGT 

CATGGGTTCA 

TTTTTTACTA 

AATTGATTTT 

ATTTTTTTGT 

GGAACAAAGA 

CTGGCAGAAT 

TCCTCTTTAT 

TCTGGTAAAT 

GCTAATTTTC 

TGTG7GCCTG 

TTTATATGAA 

ATAAACTCGC 

TCATATACCA 

AACTCCACTA 

TGTTCGATCA 

ATGGATCCTC 

CATTAGTAAA 

CTATGTTAGT 

CTCCAATTTC 

AATATTCGGC 

CTTCTTCTTC 

CATCTTCTAC 

AATTAATTTC 

ACGAGAGATC 

CCTTATATTC 

TATCAAGTTC 

ATAATCTCAT 

AATTGTCTTT 

AATACATACC 

AATTAAACGG 

CAGTGATATC 

GTAAATACAA 

GTATTTCAAT 

TTTGGTGATC 

AATAAGTGAG 

TAGCGACATC 

TTAACTGACT 

GAATCGTGGG 

TGCTATACCG 

TTGATACTTT 

TGGCTTCTGC 

AGCTGTACAA 

CACTGGCCGC 

TGTTCAGTCC 

GGGTTTCCGG 

TACCCAATGT 

GTAAAGGTAT 

AAATTTTAAC 

TAAGATTCTG 

GCAAAGTGGG 

TCACCAACAG 

GCCCCAGTAC 

TAGCCGCAAT 

ACAGCAAAAT 

GGGCAGCTGC 

GTGTCGTGAT 

TCAGAAACAC 

GAAAGCTGAT 

AGGGAGAAAC 



TTCATAAAGG 

ACCGAATCCA 

TAATAATCCT 

AGCGTGTCGA 

TGTAATAGAA 

GTGTTATTTT 

TCCCCTGTTG 

TCGTCTCATC 

ACCCAACCCC 

GCGTTACTGT 

TGCCATCAAT 

TGTA GGGGGG 

TTTGCGTTTC 

TGGAAA ATAA 

ATGGCATTTT 

TTGGCGATTC 

TCACCAGAGG 

TAGGGTCAGA 

GAATATCAGA 

TAGATGAAGC 

GCGAAGGAAC 

TGATTCACAT 

ATCAAGTTTG 

TGATTTTCTA 

ATTTTTAAGA 

TTTTCCAAGC 

CATCGATAAC 

CATTGAAGGA 

AAAAATATTT 

GGCCAAATAA 

TGATAATGTG 

ATCAGGGTTA 

TTTCTCATGG 

AATGCTGGCT 

TGAGCTCCAG 

TATCTTGCCA 

ACGAGACTTG 

TTTTTTCCAT 

GACATTGCGC 

CCCCTCATTG 

TTCAAATGCG 

TTGGGAAAGT 

GTCGATATAG 

TCGATTGTAG 

TCCTTCAACA 

CACACGGGTG 

CTCATCCCGG 

TGTCGTCGTG 

CGCCTGCAAC 

TTCACCGACA 

AGCAGCGACA 

TCACTCCCAA 

GCCACCGCCA 

TCGCGTAGCT 

GAGATCATGT 

GGTCAGTTCG 

CTGACAAAAA 

AATCATGAAA 

AGTTTCTGCT 

ACGGATCAGT 

AAGGTTTTCA 

ATCACTGACA 

TAATTGCGTT 

CGACAACAAC 
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Fig.2. 

ATGGATAATT 
AGAGACAGGT 
CATAATGTAT 
AAATGGTTTT 
TGGCCATTTA 
GAATATTGGC 
TGCTGCATAT 
TCATCATAAA 
TACAAACGAA 
TAAAGTGTGC 
TGTTATACCG 
AGCAATGGGG 
AAAATCCAAG 
TTTGATTGGA 
CTTAATCAAT 
ATTCTTATAC 
TTCATCATAA 
ATCATCTGTA 
TACAAGGATT 
GTCCATTAAT 
TAAAATTGCC 
GTAAGTGGAC 
ATGCAGCAGT 
TGTCAGTGCC 
AGGATGAAAA 
GGAAGCAATT 
ACTATTGTTT 
AGCACGGGAC 
TTGATATAAA 
GCTGAAAGAC 
ATCTTATCTA 
TGATTAATGG 
ATTGATACTG 
ATGGATTTAA 
AAATTCAACT 
ATGACTAATA 
GCTACGTTGG 
CAAAAGTTGG 
AACCTCGCAA 
TGATTACTCT 
TTTCCCCGGA 
CCAGCGTTGC 
TTGTTAAACA 
AGGGATCGCT 
AAAACAGAAT 
ATGGGTCTGA 
TGGAACAGTG 
GAATATAGCC 
TGACGAGGAT 
AAGTTAAAAA 
A CGTT GACGG 
CCTTTAAACT 
ATAACAATTT 
TACACCCATT 
TTTGACATGG 
ATGCAAGGAT 
GAGAAGATTA 
GCAATAGTTA 
ATCTAACAAT 
TTTCGTTTTC 
AACACGCTTT 
GTC CACCT CA 
TGATTTTTGC 
TTCGCCAAAC 



CATTCACTGT 
TCTCGATGGA 
ACAGTTCAGT 
TTGATTCACC 
ATAGCGGTGT 
TGATATCTGA 
AA TATT GAAC 
TACTTTCTAT 
TCGCTTTATT 
ATTGGCATTG 
TTGGTGATTT 
ACGAAATTTT 
TGGTCAGGTT 
ATTAGCTCTG 
CTTGCCGTTG 
TGTTGATTTG 
GACAAATCGT 
CCGAAATTTT 
TGATACAATT 
TCCGCTACGT 
TGACGGGCTG 
AACATTTTCA 
AATTCATTAG 
A GTGAA GCAA 
TCTTTCGCTT 
GATCCCGGTT 
AGGGTTGACG 
AAAATGCGCA 
AAGTCTGTTT 
ATAAACGAAA 
ACAGTTCATT 
TTTTTAGGTG 
ATTTATCGCT 
ATAATGAATA 
CAACAATTGT 
TTTATCTATG 
AGTCAGATAA 
TATATCGTGA 
ATGCGGGGTT 
TATTCTCAGT 
CAGCACAGAC 
TCAAGCAAAA 
GGTTGGTCAC 
TATTACGGAC 
TCAGGGATAA 
CGGCCACTCC 
TTGAAATTGA 
CCGCACTCGC 
CAAAACTAAG 
TCAAAACTTA 
ATGTAAATAT 
ATATTCGGGG 
TGGTTGTGAA 
TTTCTCATCC 
TTAAGCAACT 
TGCCATAGAC 
AATTTGGGAT 
TCTAAAATTA 
TTGATAAATA 
GCGTTGAGGC 
TGTTGCCCTT 
TCCTCATAGA 
CATTTTTTCA 
GATGCCCGTC 



9/12 



CAGATGATGA 

ACACATAAAT 

ATCATTCACT 

GGGGGTTAAA 

ATTGAACAGC 

ATGACACAAT 

ATAAAACAGC 

TACTTGCCAG 

CAGCTTTAAC 

ATAGCATCOG 

GCTCTGTCGT 

TATCTTGGTA 

CTG TTTTTTT 

CATAGTTTAA 

CCGTATCATT 

TATTTTTCTT 

AGTAGCGAGC 

TCTTCATCAG 

CAGGCGATAT 

CTGTATTACG 

ACTCATACGG 

TTACACCGTT 

ATAAGGATAA 

TGTCGGGGCG 

CCCGATATAA 

TTACAAAACG 

GTGTAATATT 

GTTCTTCAAG 

CTCGCCACGT 

TGT TTGT CAA 

AACTTTTATC 

GAGAT TATT A 

CTATTCTTTC 

CCGTATGTTA 

TTAAATATTT 

AAAGATTATT 

ATGTGTGCAA 

CAAGAGTGAT 

TTTCTTCGCA 

ACCCTTTCTT 

TATACTCAGG 

TCTTTTCGTG 

GAACTCTATG 

TTATCCGGAA 

CAGTGGTTCT 

AACTGACAAA 

TGTCAAAACT 

GGGGTTTTTT 

TTAACGGCAG 

TTTTTTATTT 

ACAGTATTAT 

GAAATTATTA 

TATA AAGCGG 

CCGGTTTTTG 

GCCACA TAAA 

GTTCAATTTT 

TCTTTGCCAG 

ACGTGATGGT 

AATCTGAGTT 

AATTGGCAAG 

TGAAGCACCA 

AGACCGGGTG 

TCATACTCAG 

CGGCAAAAGT 



ATGTCTGCCA 

TCTGGATTTG 

CTGAAAGCAC 

TCCAGTTTGG 

ATTGTAAAAT 

ACCAGCGCAT 

TTACCCAACA 

ATATCTTCTG 

AGGA ATATA T 

ACGGATTTGG 

CAATTTAATG 

TATATATTCT 

TACACTGAAA 

ATGTGAATCG 

CCCGTCATTG 

ACCGAAGGAG 

CAAAGAAGCA 

TTCTGTTGAA 

ATCAGTCTTA 

GCTAAATGCG 

CAGATGATAG 

ATAGTCAGTT 

TGTGGAAATT 

TCGTTTATTC 

TTCTGTTAAA 

GTGGGCGCGG 

AAGGTTAGTG 

TTTATTCTGT 

CAGAGTTCCA 

TAATAAAGTA 

ATATAAATCC 

TAATCTGATA 

AATAAAAAAT 

AAAATTAAAT 

TTAATTGTGT 

TATTGAGGAT 

AAAGAAATCC 

AGTAATGTCA 

TAATCAAAGA 

TTGGTGCTTT 

GTGGATTTAA 

ATGATGCGTG 

AATTCGCGGC 

AGCTATCTGG 

GTTTATGTTG 

GTTCGTATCG 

ATCCGCATAG 

GCTTTCTGGG 

TGGTCACTGA 

AATAGAGGAA 

AGTCCTTTGA 

TGTCAGATGT 

ATTTATTTAA 

CTGTTGTAAG 

TTGGCAGCAG 

ATTCAACCAC 

CCAAACCCTG 

TTTGGCATTA 

CTTTCTCAAG 

GTAGTGTTTT 

GTCTGCACCG 

TTTCTCTTGA 

GGTCAGGCAA 

AGCGATAGAG 



GCAGACGAAC 

TTCCGCCATT 

GTTTCATTAT 

TATTATCAGC 

GACTGGGTTG 

CGCTGACGCT 

CATTGCTGTC 

GAGATATGCC 

CACCGGGAAC 

TTAACTCGCC 

GGAATACTGT 

TTATCTCCAT 

TTATATTTGT 

TAGAAATCTT 

ACCAATGTTA 

AGATTGACAA 

TAACTCTTAA 

TTTTCCGGTG 

ATAGCCAGTA 

GTGAGGTTTT 

GGTGTCATGC 

TTCTCTAACG 

TCTTCATCCA 

AGGTGATATT 

TAAGCCGCTG 

CCATAAAACC 

ATATTAGCCA 

TTTGATTCCT 

CTTGTCCTAT 

TCACCAGCCT 

TTAAGTTATT 

GGAATATTAT 

AAAGAACTTC 

TTTAACAAAC 

TTGTGCTGTT 

GTCTTGCTTG 

TTAATAAAGT 

CATAATTTAT 

GAAAGCTATG 

GG CACAGCAG 

AGGTCCAACT 

GGTTGTTCTG 

CGCATAATAC 

AACCCCTGTT 

ACATTGATGA 

AAGGTGAAGT 

TGAAATAACT 

AGTCGGAAGT 

TTTGGTGCAT 

TGTCACCCTG 

TATGTTATTA 

TCGTA ATAT T 

ATAAGTTTTC 

GAAGCGGTTT 

TGGTTTCGTG 

GGGCAATAGG 

ACCTTCCGGC 

ACATATTGAT 

CTACCGACAT 

TGGTTCTTTC 

ATTTTCGGGT 

GGCAT TGGAT 

TTTTACGGTT 

GGTACTTTGA 



GCGATAAAGC 

AGCCAGTTTC 

TCCCAAATAA 

AGAAAACTCT 

TTGTTTAGTG 

AATATTATAG 

AATGGTTAAG 

TGTGGCTTTA 

TCCATCATTT 

ATAAGCGGAG 

AATGGGTATT 

TCTGGAGACG 

ATTCATTTTC 

TGCGGGTTCG 

TCAGTTGCTC 

ATAAACTGAG 

AAATCAGTAC 

TAATTTCTTC 

GCGATGTTGG 

TATCTTGCAA 

CGGTTTGCCG 

TCTGAATATT 

TATTATTCTG 

GAGAATTGTC 

GTGAAAATAT 

AACTGTTGTA 

GTTGTGGATT 

GATGAGCCTG 

GACGAAATTC 

TTTTCTATTT 

GTCAATTTAA 

GGTTAATTAA 

CCTATAATAC 

TTTCATGAAA 

TGAAAAATGA 

GTTTCAGGGG 

TGCGTAATTA 

TGAATACCCG 

AAAAAAACAC 

GGTGGCTTCG 

CCCAACCTGA 

GAAGGAAACA 

GACTCACTAT 

ACGCCTGAAT 

TAAGCGCTGG 

GGACAA AGAC 

CAAGCACTTT 

TTAACCGTAG 

AAGTTATCAA 

TAGGTGAATA 

AATTGAAAAA 

ATTAATGTTG 

ATAATTGTGA 

CCATGAAGAT 

TCACGGTTTC 

TCGGTAAAAA 

TCTTATGAAT 

TGTTAATTTC 

AAGTGATTTC 

CGGGGGTAAC 

TCAGGTTGAT 

AACGTCTCGC 

GGTGCCGCCC 

GAGAGCGATG 
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Fig.2. 

TATTCAGTAG 
AGCCAAAATG 
AGTTCCAGAT 
TTAACCAATT 
GAGAAACCGT 
TATCCCGGGT 
TTATGATTGG 
CGCGAAAATC 
GGAGTCAGCA 
TGATTCATCT 
TGGCCATTAT 
GTTTCAACAT 
AATTATG TCT 
CGGGAACCTT 
AATTAAG TTT 
TTTATGGTTT 
TTATATAGTA 
TAACAGAAAA 
GAGCTGTATT 
GTCAGCAGAT 
AGTAAAGGCG 
AGCCTCACCA 
TTTTATCCCT 
GG AACCGGCA 
GTCGAGACAA 
GATCCGTTAA 
AAAAAAGGAC 
AGGCCTTACC 
CACCTGTATA 
CTGACACATA 
ATGCTGGAGT 
CAGCGCCATG 
TTTGTCCATC 
GAACAGCTTG 
GCTGAAGGAC 
AGTCTGGACA 
CATTAAATGG 
ATTTTATTTA 
TATATAACGG 
CGTGAGCAGC 
CCAATTTCAT 
AAATCCAGTG 
CCAGTTCAAA 
GCAGACAGAA 
TATAGCGGCG 
GGTGAATTTT 
AGTAATCCAT 
TGGGGGGATA 
CGTAGGTCAT 
CCATTTTGGC 
AGGCGATGTT 
GTTCCGGGTC 
CCACCGCAGT 
GCACGTTGCT 
ATACTGAATG 
AATCCCCCCA 
CTGGCATCAC 
TAAGGATCAA 
TCTGCTTGCC 
TCACCGTTGA 
TCCAGATCAA 
TCTGGCGACA 
AGCGGTTCTG 
GCTGAAGAAT 



CTCATTGATT 
TTGTGGCGAG 
GGCAGGCCTT 
TACCCAACGA 
ACT CCCTTCA 
TTTCTGGATA 
CATGACTCAG 
GGACTGAGTT 
AATGAGTTAT 
ACCGGTGGTA 
ATCAAAGTTA 
GGCAGTTATG 
GGTGATTCAG 
CCACTGATGG 
ATATTTCATC 
TATATTTAAT 
AATAAATTCT 
TTCATGGTTA 
TACTGTAGAA 
AATATGTTGT 
ATTAATAACC 
TGACGCGTTA 
TTATCTGCCG 
GCTTTGTGGA 
CCCACGGGGA 
TGGCCTGGCG 
ATACTGAACT 
CTTACTCAAA 
ATCAGCCCCT 
AAAGCATTGC 
GGGTTCCCCA 
TTGTGTTAAG 
AACTGACTGA 
AACAAAAAGG 
GGGAAGAAGG 
TCATTGTCAC 
ATACGCTTTT 
CTACGATTTA 
TCCCATATCA 
ATTTGCCAAC 
TTTGGTTGCA 
ACCACCGTCA 
CTGATTTTTC 
GAATTCCGGA 
CGTTGTCAGA 
CTCCGGTTGT 
GTCCCGATCA 
CAGGTTAGTA 
CACAAAGATA 
AACGACGGCG 
CAGTGCTTCA 
GAATTCATTA 
AAACATGGGA 
CAGGATCTTT 
CGAGTTCCAG 
GTAAACCGGA 
GGCGCTGATC 
CGGGTACAAT 
GGTTCCACCC 
CGGCCATAAA 
AACCACGGCC 
AACGCGCATC 
TATTTTCCGG 
AACTCAAAGG 
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TTAAGTGTAA 
TGCTGTAATA 
CCCGCCGGGA 
TGAACGGAAG 
TGTAACATCA 
GCTTTTTTCA 
TCCATTTTGG 
CCCTTCAAGT 
TCCCCATAAT 
TGTGGATTCC 
CTTTCAGTAA 
TTTATTTTAC 
CTAAAGGCAA 
TATTGAATAA 
TGGTTTCTGC 
GCCAATCATA 
GTTGGATGTG 
GGAAATTCAA 
CTCGCATTGA 
ATATTGGCTG 
GATAAAACAG 
TTCAAACATT 
GAAGCGATCC 
CAGGCAATTA 
CGGTTACATT 
GCTGATGTAT 
CCCnTGGTC 
TCGATGGCTG 
GCCGTTGGTG 
CTTGATGGAG 
ATTGGTGGCG 
CTATATTTTA 
ACAATCTCCi 
GCGTGAGCAA 
CAAGCTGGAA 
CAGTACCGGC 
TCACAGCAGG 
CGACGGGTTA 
ATCTTCTCTT 
AGGCCATCAT 
TAAATTCCCT 
GCATTAAAGA 
CCGCGTGCAA 
GCACCTTTTT 
TCAGCACCCA 
ACACCTTGTG 
GGATTGGGCG 
TGGTGACCGA 
TTGTCTAAAT 
CTACAGGCTA 
CGCAGCTCTT 
CCTTCTTCAC 
AAACGCCGGG 
GGCCATCACA 
CTTATGCCCT 
GGCTGCATCC 
CGCGTCCAGA 
ATGGCCTAAT 
GTCAACAACC 
ACTGAAAATC 
GGGGGCATCG 
ATACTGGCAC 
ATCAACTTCA 
AGTTCCGCTG 



TAAGCTCAAG 
AGAAAGAAAT 
GGCTTTTAAG 
AACGTGAACA 
AGAGCGCGGT 
TC GGACGTCG 
GATTTGTTTT 
GATCTACTAT 
ACCTGACCAT 
TTGGTGCGAT 
AAAGGACGCT 
AGACAGTAGT 
AGGGGATTAT 
ATATTGATTC 
AATTAAGTTT 
TTATTTTTCT 
ATTA TTATT G 
TCAACTTTTG 
TACTGGATTG 
TGGATTTTTC 
AGAGA CGGAT 
TTTTAACCCA 
GGTCAGTGTG 
CGTCAGTTGC 
TATTGCCTGA 
TATTCGCTGT 
GTCCCCCTGC 
GATTGTTTTA 
GATATCAGTG 
CTGGTACAAA 
TTGTTGAATG 
CTGAATGGAC 
TGAAA 
GG Z CGG A CAG 
ACGGCGCGCG 

'GGTGAC 
CT7TAGGAAG 
TTCCGCGTAC 
CCTGATCGCC 
TATGCAGCAC 
GTGCGTCAGC 
TTTCATATTC 
CCATCGTGCC 
GACATGAACG 
ACAGTAAAAA 
GAGGAGGGTT 
TGTATTCTGC 
AAGGTGCGAT 
TCGTGATTTC 
TCACTAACAA 
CTGTGGCGCC 
AAGAAGTCGA 
GAGAAATACC 
GCCTGTTTTG 
TGATTGTAAT 
CCGACATTGC 
GTAATAGGGG 
TCATTAATCC 
AGGCGGTCGT 
TCGCTGGTCA 
CAGTCAGTAA 
TATTCGTTGT 
CCGTCAGGTT 



GCTCCATCGT 

GACTGTGAAG 

TCCTTCCAAC 

GTGAAGCGTT 

GAAGCGACGT 

TTCATTTCGG 

GATTTGGCGA 

TTTGAAATCT 

GTGGTTGTTT 

AGTCAGAAAG 

GCTGATATTG 

AAACAATTTA 

AAGCTTGAAA 

ATTATTATTT 

TAAAAATTAA 

TATAATAATT 

TGAG ACGG TA 

TCCGGTTTCC 

ATTAGCCGGA 

AGCGAGATGA 

TGTGGCCAGG 

ACCAGAAACC 

TGATTTACCA 

ACAGTGATGT 

TTGAACACCA 

CAGCCATGGC 

TGTTTTATCA 

CACTCTCTGA 

CGCTCAGTGA 

AACATATCCG 

CCGGTTATAA 

ATACGCTGGA 

CCATGTTGAT 

AAGGCAGAAC 

CATTATTACG 

AGAAAATTGA 

CCCTGTGAGG 

CTGAATGAGA 

AGGTAAGTAA 

TGACCAAGAG 

AGTGCGGGGC 

GTCGGTTTCC 

CGCATCGTAT 

CAGTGGCTCT 

TCCATAGTTA 

GCGGATCGCC 

ATCGCCGTCA 

CC AACCG GTA 

TTCTTTGAAG 

TTTACGGGCC 

AACATAGTTT 

GGGGTATTCC 

CGATGCTACT 

CTGACA TACT 

CTCGCGCTTT 

ATTGCAAGAA 

GTGTGGTGCC 

CAATCTGGCC 

GTTCGGATAA 

AGGCGGTAGG 

GCGCAGTGTT 

TATAGGCAGA 

ACAGGGACTT 

TATATCCCAC 



GAACGGAGAT 

AGCGGAGCTA 

CCGTATAATG 

CTGGAAACGT 

GCATAGTCCT 

GGTATTGATG 

TTAATCAGAT 

TATTTAATCA 

ATCCGGGAAA 

ATATTGACTC 

TGAACTACAT 

GCAATAAGCA 

TTAAAACAAA 

ATGGA TAAGA 

TTCTACTTTT 

GATAGTTTAT 

ATAATTAACA 

TGACCATGAA 

CGAGTGTTGG 

TAGCTTTGGC 

AAAGCAAAAA 

GCCCGGGAAT 

CACTAAAACT 

GCTGTATTCT 

GTCCACGCCT 

TGCGCATCTG 

TGGTGAGGTG 

ACACGCGGCT 

TGAAGAGATC 

TTGCCGGGAT 

TAGCGCCGAA 

TCTCGCCCAG 

GACTATTGCA 

AGAAGGCAGA 

GCATGGTGTC 

AGCGTTAAAG 

CCAC CGGA AA 

CGTCCTTTGT 

CCCAAACCTT 

AAGATCCCGC 

GTATCCAGTG 

GTGTCTGTCA 

TGGTTATTCA 

CCTGTTCTGT 

GCAAATCCGA 

TCATCTGCCG 

TATTCATATC 

CCAAAGAAGT 

CTGGACTTCT 

CGGGTTCCAA 

GGGCCATCAT 

CAGTCGATAT 

CACAAATGTA 

CCAGCCGCCG 

CAGATTACGC 

ATTCTTCGGG 

TAAATCACCA 

ACTGCTGGCT 

CTTGCCTTTG 

CGGGATTTTT 

ATCCTGGGTT 

GACTTTAGGC 

GGCAACACGT 

CTTCTGATAG 
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Fig.2. 

GTTTCTTCTG 
GGCGTATTGG 
CAGGCATTCT 
TCAACCCAGC 
CCAGATTGAT 
GCTTTAATTG 
TATTCCGGGG 
AGACGGCTAC 
TAACGCAGAT 
AATTCCAACT 
TCAAAATTGA 
TTAGCCTGTA 
TAATCGATAG 
GCCCGTACAT 
CGCCGATCAG 
AATGAACCCG 
ATGGTTTTTA 
TGTAAGGCTA 
CCACTTTGGA 
AGTAAACAGA 
TCATCCACAT 
CCGAAGCATA 
TCCGCCACAG 
GATTGTAATG 
TAGTGTTGCT 
TGTTGCATGT 
TGCTGCATAC 
GTGCTGCCGT 
AAACGATAAT 
TGGTAACGCT 
AACTCTTTAC 
TGCCACAGTT 
GAGACGGATG 
TCCTGTGCCC 
CAGTTTGCCG 
ATCTGCTTGG 
TCTAACAGAT 
GCTGCAATGG 
TCAATAATTG 
TCATCTTCCA 
TCTATATTCA 
ATCCCATTTG 
GT AATAAGCT 
AGTTTTTTTG 
GCGGGGAAGG 
TGCATACCGA 
GGCTTAAAGA 
GGATAGGTTA 
TTATCGTGCA 
TTAA AGTCA G 
ATCAGTTTGA 
TAGGAAAGCG 
T GCCTT AAGG 
CCTTTTCGTC 
TCGCAGACAT 
TGAAGACGGA 
ATCACAGCAA 
TACTCCAGCT 
AG GAAC CCAA 
ATCTTTAGTT 
GAAAATGGGG 
CTCGCTCCAG 
TGAACGCCAG 
GCGTTTGATG 



TGAGTGCATC 
GGTTACCGTG 
CATAAACCGG 
CGATGTTTTT 
TCTGCCAGGC 
AAGGGTATCG 
CCGGCTCCTG 
GAACCTGCTG 
GGAGGATAAT 
TCCACTGTGA 
GCCTGAACAT 
CGTTCAGCAA 
TTTTTAAGTC 
CTTCATAAGG 
CGTGGCTCGG 
TACTCTTGCC 
TGATCTCCAG 
ATTTCAGATG 
CGAAATTCAT 
GGGAATTAAC 
GTATAACGCA 
CCGCCAAGAC 
CGGCCGCAGT 
CGATAACTTC 
GGCGCTCCTG 
CAATCGCTTG 
GGAAGTGTTC 
ACTGAATCAG 
TCGGGATTGC 
GCCTGAGCAG 
CATCCAAGGT 
TGAGATATTC 
TATCATATGT 
GTGCTTCACT 
CGAGCAGAAT 
CGACAGTACG 
TCTTGACATA 
CATCCGGATC 
GCCGTGAATT 
GCGGATTAAA 
GTCGCCACGC 
CGCCATTTAA 
GTGTATTCCA 
CAAACAGTGT 
AAAGGAATTG 
TTCTGAACTC 
GAAGTTCGGC 
AATCAAGAAC 
TCAGCCGGAA 
ATTGAG CGAC 
TACTGTCTTT 
TTCTTCCCGG 
CATCCTTGTG 
CATCAGCATA 
AACCGAAGGC 
CAAACGGAAC 
CCATCTGGGC 
GCCATCATAT 
TCACCCGCAC 
TCAGACTGTT 
CGTATCAAAT 
GCATTGGGAG 
TAATATTGGT 
TTAACACCAT 
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ATATTGCAAT 

ATCGGCAATT 

TAAATCAGGT 

AAAAACCGCG 

AACCAGAGAT 

ATAAACATTT 

ATATCAGTTA 

GCTGAACTCA 

ATCGCTCAGC 

AATGGCGCCT 

GCCGCTGTCT 

AACGTTTTCG 

AGCAGTACTG 

CCCCAGCAAT 

ATAATCGCGC 

AATTTCCCAC 

ACGTCTGGTG 

TTCTCCGACC 

ATCATACTGA 

CGCATCATAG 

TCATTGGTAN 

CATCCCCCCG 

GGCGG CTGAC 

CTGCTCGGTG 

AGCGGCCCGT 

CTGTTGCAGA 

AAAATCGGTA 

CGTTTCTGCG 

CCGGCGTTTC 

ATCTTGCGGG 

CAGGTTATGA 

CGTATCAACA 

CACAGGCAGA 

GTTTTCATAC 

ATCAGGGCTG 

TGCACTGGAT 

GAAACGGAAT 

GGTTGGTTCA 

CCAGTAACGC 

AATATAGTGC 

GACCAGAAAT 

ATCAATCGGC 

GCTCAGTACC 

ATTAAGGCGA 

CACCTGATCC 

TTGAGTACAG 

TTTCAGGGTG 

TTTTTCGCTC 

AGAACCGTTG 

AATCTCCAGT 

GCCGAAATCA 

GTTGCCGGAT 

TTCACGTGAG 

TTGGTCATCC 

TTCGTCATAA 

CAGAGCCAGA 

CATCCGGTAT 

TTGGCATAAG 

TAGGCTCAAC 

CTTCAACTTC 

TGGGGTCTAC 

ATAACGCATC 

ATGGCTGTGT 

CTTCATAACC 



ACCTCGGTTT 

TCTTCCGGTG 

GAAATATTGC 

CTATCATAAA 

GCGCCTACTT 

TGAGACATAA 

GAATTGTCTT 

TTACTTCCGC 

GACTCCAGCA 

GTCCCTTCAA 

T CCATG GCCG 

GGTTTGGTGT 

TAAAGCGTAT 

GCGGGCAATG 

AAGAACATTT 

TGTGATGATG 

TTATGTTGCA 

AGCAGCCCCT 

CCTGTTTCGT 

GCTTGCAGGT 

ANTTGTTCNN 

ACGGCCAGAC 

TGGGCAGCGA 

ATGGAGATGT 

CGGCTGATGG 

TTGC GGGTAA 

TTGTCTTTTT 

GCCTCTTTTG 

ATGCCCGCCA 

CTGATGGGTT 

CGTAAGTTAT 

GGTTTGACAA 

AGTGGCACGT 

AGAGCCACAT 

GTACCCAGTA 

GTCA GCTT AC 

ATTGCTTTCC 

ATTAACATCC 

GGTTTACCTT 

AGCCATTCGG 

GGCATATGGA 

GTAGGGAATG 

TGCGGGATAC 

ATGTTTTGTG 

TGTTCATTGA 

CTGGCACTTT 

ATTCGATTAT 

AGTACCAGTG 

TAATATTGAT 

GTGTCATCAG 

GGGTTCATTC 

AGAGCACCAT 

TAATACCAGA 

GGCAAATCAG 

TCATAATCCT 

AACGGGTTAT 

TGCAGATGTC 

CGATTTTGAT 

GTTTTGGTTA 

CGTCCAGGCA 

GCTGCCCAAT 

GGTATCAGGA 

ACGGGTACGT 

TGCGATAACT 



TTTCTCCCGG 

TCGCCTCACG 

GGTCGGGAAT 

TGACATACCA 

CGCTGCTGGC 

TTTCACTTCC 

GTTTTAATTG 

CACTCACATC 

GCTGATCCTG 

AAGGCAGGAA 

TTGAAATCAC 

ATTCCAAGGG 

TGCTGAGTTG 

ACAGCGCTAC 

CGGCGCTCAG 

TCAGTAATGA 

AATACGCCTG 

GATAAAGATC 

ACTGCCAGGA 

AAAGCCGGAG 

NNNNNNNNNN 

CGAAAATATT 

TCACACCTTC 

TTTCATCATA 

TCAGTGCATC 

AGCTGTACAG 

TCTCCAGCAA 

CCCGGCTCAT 

TACGATTAGC 

CATCGTATAA 

ATAGACGCTG 

ATAAATCAGA 

TGCTGACAGT 

CTTGCAGCGT 

ACATATTGAC 

GGTATTCCAT 

GGTAGTGAAT 

GGTACACGGT 

GGTTGCTGGC 

TGGCCTCTTT 

AAAACAGTTC 

AACCGGGTAT 

CCTGACTGGC 

GCGCGTTATC 

GTTTAATCAG 

CATTGCCAAC 

CCGACCCCAG 

GTTGTTCATC 

GATCTTCTAT 

TGC CATG AAC 

CGGTTTGGAT 

AGTACGGTAA 

CCAGGTTGCC 

TAATTTCTAC 

TACCTTTCTT 

GCGGGTCTTG 

TTCGCGCAGA 

CCGGTCAGGA 

TGCAGTGATA 

ATATACAGGC 

GGCAGGTCAA 

TGGCGTATCG 

CCGACAAAGA 

TTCAGGTTAC 



CGGTACATCA 

GACATATTGC 

ATGCCAGCGT 

GGTTTGACCA 

GTCAGACATC 

GGCCCCGTTA 

ATGTTTATTC 

ACGCGCGGTA 

ATCGGAACCG 

AAGTTCATCA 

CACACCTTGA 

GTTAAGCAAA 

TA CCAGT GAA 

GGTTTTTATA 

TAAGAAAGTG 

TTTTACCGAT 

ATCCATCCGT 

ATTCCAGAGA 

G GCTT CGGCC 

ATTTGGCTGA 

NNNNNNNNNC 

GGGAACCATA 

AGCCGCTCTT 

GAGCGATTTA 

CAATGAAGCC 

CCCCAGTTGC 

ACTCAGTAAC 

GATCGGGGTG 

CACAACACGC 

TCCGGCCGGA 

ATCCAACATT 

CGGTGCGGCA 

AAGCATTAAC 

ACGGGGTTGC 

GGAGTCATAG 

GTCTCCCTGA 

GGGTTCACTG 

GGGTGGAGGA 

CTGAACAAGT 

TAATCGTTGT 

CCAGAAATAG 

AGGCTGTTCG 

AATGGCGATC 

AGTTTCATCT 

TTCGCGAATA 

ACCACCTTTG 

CTTGATTGAT 

CAAGACAGTA 

CGCACCAAAC 

AAAATTGACA 

TCTCCGGCAA 

TCGATAGGAT 

GACATATTTT 

CAGCAGTGTA 

ATCTGTCCCC 

CTGTATATCC 

ATGGTGGGTG 

ACGGTGTGGG 

ACGCAGTTGT 

GATTATTCAG 

TAGGTTTCCA 

AAAGATTCAG 

AGAACTTATC * 

TGACATCTTC 
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Fig.2. 



38101 AAAATTATTC AGATAACCGA GCACCGCTTG TTGTACAGAA TCTTCGGTAA TTTTTCCCTG 
3B161 ATTAAGGGCA CTTTCCAGTT GGAAGAAGAA TTCTGTTTTA TTCAGGCGTA ACAGGGGTTC 
38221 CAGATAGCTT TCCGGATAAG TCCGTAATAA GCGATCCC 
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